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TITLE : Novel Kallikrein Gene 
FIELD OF THE INVENTION 

5 The invention relates to nucleic acid molecules, proteins encoded by such nucleic acid molecules; and 

use of the proteins and nucleic acid molecules. 
BACKGROUND OF THE INVENTION 

Kallikreins are a group of serine proteases that are found in diverse tissues and biological fluids. The 
term "kallikrein" was first introduced by Werle and colleagues who found high levels of their original isolates 

10 in the pancreas (in Greek, the "kallikreas") (1,2). Kallikreins are divided into two main groups; the plasma 
kallikrein, which is a single gene (3), and the tissue kallikreins, which are encoded by a large multi-gene 
family in rodents (4,5). Until recently, the human kallikrein gene family was thought to consist of only three 
members (6). However, 11 new members of the kallikrein gene family have been identified (7-18). The 
progress in this area of investigation has recently been reviewed (7). 

1 5 Prostate specific antigen (PSA), currently the most useful tumor marker for prostate cancer diagnosis 

and monitoring, is a member of the human kallikrein gene family of serine proteases (19,20). In addition to 
PSA, human glandular kallikrein 2 (hK2, encoded by the KLK2 gene) has been proposed as an adjuvant 
diagnostic marker for prostate cancer (21,22). Moreover, accumulating evidence indicates that other members 
of the expanded kallikrein gene family may be associated with malignancy (7). The normal epithelial cell- 

2 0 specific 1 gene (NES1) (KLK10, according to the approved human tissue kallikrein gene nomenclature) was 

found to be a novel tumor suppressor, which is down-regulated during breast cancer progression (23). Other 
gene family members, including zyme (KLK6), neuropsin (KLK8), and human stratum corneum chymotyrptic 
en2yme (HSCCE; KLK7) were also found to be differentially expressed in certain types of malignancies (24- 
26). 

25 SUMMARY OF THE INVENTION 

The present inventors identified a nucleic acid molecule encoding a novel kallikrein. The nucleic acid 
molecule maps to chromosome 19ql3.3-ql3.4 and is located between the klkl and klk3 genes. The novel 
nucleic acid molecule designated "klkl 5" has three alternatively spliced forms and is primarily expressed in 
the thyroid gland, and to a lower extent in the prostate, salivary and adrenal glands, colon, testis, and kidney. 

3 0 The expression of the nucleic acid is up-regulatedin prostate cancer and it is under steroid hormone regulation 

in the LNCaP prostate cancer cell line. Higher expression of klkl 5 is associated with more aggressive (higher 
stage and higher grade) prostate tumors. 

The novel kallikrein protein described herein is referred to as "Kallikrein 15", "KLK15", or "KLK15 
Protein". The gene encoding the protein is referred to as "klkl 5". 
35 Broadly stated the present invention relates to an isolated nucleic acid molecule of at least 30 

nucleotides which hybridizes to one or more of SEQ. ID. NO. 1 through 5, or 10 through 24, or the 
complement of one or more of SEQ ID NO. 1 through 5, or 10 through 24 under stringent hybridization 
conditions. 

The invention also contemplates a nucleic acid molecule comprising a sequence encoding a truncation 
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of a KLK15 Protein, an analog, or a homolog of a KLK15 Protein or a truncation thereof. (KLK15 Protein 
and truncations, analogs and homologs of KLK15 Protein are also collectively referred to herein as " KLK15 
Related Proteins"). 

The nucleic acid molecules of the invention may be inserted into an appropriate expression vector, 
5 i.e. a vector that contains the necessary elements for the transcription and translation of the inserted coding 
sequence. Accordingly, recombinant expression vectors adapted for transformation of a host cell may be 
constructed which comprise a nucleic acid molecule of the invention and one or more transcription and 
translation elements linked to the nucleic acid molecule. 

The recombinant expression vector can be used to prepare transformed host cells expressing KLK15 
1 0 Related Proteins. Therefore, the invention further provides host cells containing a recombinant molecule of the 
invention. The invention also contemplates transgenic non-human mammals whose germ cells and somatic cells 
contain a recombinant molecule comprising a nucleic acid molecule of the invention, in particular one which 
encodes an analog of the KLK15 Protein, or a truncation of the KLK15 Protein. 

The invention further provides a method for preparing KLK15 Related Proteins utilizing the purified 
1 5 and isolated nucleic acid molecules of the invention. In an embodiment a method for preparing a KLK15 
Related Protein is provided comprising (a) transferring a recombinant expression vector of the invention into 
a host cell; (b) selecting transformed host cells from untransformed host cells; (c) culturing a selected 
transformed host cell under conditions which allow expression of the KLK15 Related Protein; and (d) isolating 
the KLK15 Related Protein. 

2 0 The invention further broadly contemplates an isolated KLK15 Protein comprising an amino acid 

sequence of SEQ.ID.NO. 6, 7, 8, or 9. 

The KLK15 Related Proteins of the invention may be conjugated with other molecules, such as 
proteins, to prepare fusion proteins. This may be accomplished, for example, by the synthesis of N-terminal 
or C-terminal fusion proteins. 

2 5 The invention further contemplates antibodies having specificity against an epitope of a KLK15 

Related Protein of the invention. Antibodies may be labeled with a detectable substance and used to detect 
proteins of the invention in tissues and cells. Antibodies may have particular use in therapeutic applications, 
for example to react with tumor cells, and in conjugates and immunotoxins as target selective carriers of various 
agents which have antitumor effects including chemotherapeutic drugs, toxins, immunological response 

3 0 modifiers, enzymes, and radioisotopes. 

The invention also permits the construction of nucleotide probes that are unique to the nucleic acid 
molecules of the invention and/or to proteins of the invention. Therefore, the invention also relates to a probe 
comprising a nucleic acid sequence of the invention, or a nucleic acid sequence encoding a protein of the 
invention, or a part thereof. The probe may be labeled, for example, with a detectable substance and it may be 
3 5 used to select from a mixture of nucleotide sequences a nucleic acid molecule of the invention including nucleic 
acid molecules coding for a protein which displays one or more of the properties of a protein of the invention. 
A probe may be used to mark tumors. 
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The invention also provides antisense nucleic acid molecules e.g. by production of a mRNA or DNA 
strand in the reverse orientation to a sense molecule. An antisense nucleic acid molecule may be used to 
suppress the growth of a KLK15 expressing (e.g. cancerous) cell. 

The invention still further provides a method for identifying a substance that binds to a protein of the 
5 invention comprising reacting the protein with at least one substance which potentially can bind with the 
protein, under conditions which permit the formation of complexes between the substance and protein and 
detecting binding. Binding may be detected by assaying for complexes, for free substance, or for non- 
complexed protein. The invention also contemplates methods for identifying substances that bind to other 
intracellular proteins that interact with a KLK15 Related Protein. Methods can also be utilized which identify 
1 0 compounds which bind to KLK15 gene regulatory sequences (e.g. promoter sequences). 

Still further the invention provides a method for evaluating a compound for its ability to modulate the 
biological activity of a KLK15 Related Protein of the invention. For example a substance which inhibits or 
enhances the interaction of the protein and a substance which binds to the protein may be evaluated. In an 
embodiment, the method comprises providing a known concentration of a KLK15 Related Protein, with a 
1 5 substance which binds to the protein and a test compound under conditions which permit the formation of 
complexes between the substance and protein, and removing and/or detecting complexes. 

Compounds which modulate the biological activity of a protein of the invention may also be identified 
using the methods of the invention by comparing the pattern and level of expression of the protein of the 
invention in tissues and cells, in the presence, and in the absence of the compounds. 
2 0 The proteins of the invention, antibodies, antisense nucleic acid molecules, and substances and 

compounds identified using the methods of the invention, and peptides of the invention may be used to 
modulate the biological activity of a KLK15 Related Protein of the invention, and they may be used in the 
treatment of conditions such as cancer (particularly prostate, colon, kidney, and testicular cancer) and thyroid 
disorders in a subject. Accordingly, the substances and compounds may be formulated into compositions for 

2 5 administration to individuals suffering from disorders such as cancer (particularly particularly prostate, colon, 

kidney, and testicular cancer) and thyroid disorders in a subject. In particular, the antibodies, antisense nucleic 
acid molecules, substances and compounds may be used to treat patients who have a KLK15 Related Protein 
in, or on, their cancer cells. 

Therefore, the present invention also relates to a composition comprising one or more of a protein of 

3 0 the invention, or a substance or compound identified using the methods of the invention, and a 

pharmaceutically acceptable carrier, excipient or diluent. A method for treating or preventing a disorder such 
as cancer (particularly prostate, thyroid, colon, kidney, and testicular cancer) and thyroid disorders in a subject 
is also provided comprising administering to a patient in need thereof, a KLK15 Related Protein of the 
invention, a substance or compound identified using the methods of the invention, or a composition of the 
3 5 invention. 

Another aspect of the invention is the use of a KLK15 Related Protein, peptides derived therefrom, 
or chemically produced (synthetic) peptides, or any combination of these molecules, for use in the preparation 
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of vaccines to prevent cancer and/or to treat cancer, in particular to prevent and/or treat cancer in patients who 
have a KLK15 Related Protein detected on their cells. These vaccine preparations may also be used to prevent 
patients from having tumors prior to their occurrence. 

The invention broadly contemplates vaccines for stimulating or enhancing in a subject to whom the 
5 vaccine is administered production of antibodies directed against a KLK15 Related Protein. 

The invention also provides a method for stimulating or enhancing in a subject production of 
antibodies directed against a KLK15 Related Protein. The method comprises administering to the subject a 
vaccine of the invention in a dose effective for stimulating or enhancing production of the antibodies. 

The invention further provides methods for treating, preventing, or delaying recurrence of cancer. The 
1 0 methods comprise administering to the subject a vaccine of the invention in a dose effective for treating, 
preventing, or delaying recurrence of cancer. 

In other embodiments, the invention provides a method for identifying inhibitors of a KLK15 Related 
Protein interaction, comprising 

(a) providing a reaction mixture including the KLK15 Related Protein and a substance that binds 
15 to the KLK15 Related Protein, or at least a portion of each which interact; 

(b) contacting the reaction mixture with one or more test compounds; 

(c) identifying compounds which inhibit the interaction of the KLK15 Related Protein and 
substance. 

In certain preferred embodiments, the reaction mixture is a whole cell. In other embodiments, the 
2 0 reaction mixture is a cell lysate or purified protein composition. The subject method can be carried out using 
libraries of test compounds. Such agents can be proteins, peptides, nucleic acids, carbohydrates, small organic 
molecules, and natural product extract libraries, such as those isolated from animals, plants, fungus and/or 
microbes. Still another aspect of the present invention provides a method of conducting a drug 

discovery business comprising: 

2 5 (a) providing one or more assay systems for identifying agents by their ability to inhibit or 

potentiate the interaction of a KLK15 Related Protein and a substance that binds to the protein; 

(b) conducting therapeutic profiling of agents identified in step (a), or further analogs thereof, 
for efficacy and toxicity in animals; and 

(c) formulating a pharmaceutical preparation including one or more agents identified in step (b) 

3 0 as having an acceptable therapeutic profile. 

In certain embodiments, the subject method can also include a step of establishing a distribution 
system for distributing the pharmaceutical preparation for sale, and may optionally include establishing a sales 
group for marketing the pharmaceutical preparation. 

Other objects, features and advantages of the present invention will become apparent from the 
3 5 following detailed description. It should be understood, however, that the detailed description and the specific 
examples while indicating preferred embodiments of the invention are given by way of illustration only, since 
various changes and modifications within the spirit and scope of the invention will become apparent to those 
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skilled in the art from this detailed description. 
BRIEF DESCRIPTION OF THE DRAWINGS 

The invention will now be described in relation to the drawings in which: 

Figure 1 shows the genomic organization and partial genomic sequence of the KLK15 gene. Intronic 
5 sequences are not shown except for the splice junction areas. Introns are shown with lower case letters and 
exons with capital letters. The coding nucleotides are shown in bold and the 3' untranslated region follows the 
TGA stop codon (encircled). The translated amino acids of the coding region are shown underneath by a single 
letter abbreviation. The start and stop codons are encircled and the exon -intron junctions are underlined. The 
catalytic residues are boxed. The putative polyadenylation signal is underlined. The exact start of the first 

1 0 coding exon was not determined. 

Figure 2 shows an alignment of the deduced amino acid sequence of KLK15 with members of the 
kallikrein multi-gene family (SEQ ID NOs. 25-38). Dashes represent gaps to bring the sequences to better 
alignment. The residues of the catalytic triad (H, D, S) are shown in italics. Identical amino acids are 
highlighted in black and similar residues in grey. The 29 invariant serine protease residues are marked by ( • ) 

15 on the bottom, and the cysteine residues by (+) on top of each block. The predicted cleavage sites of the signal 
and activation peptides are indicated by arrows. The dotted area represents the kallikrein loop sequence. The 
trypsin-like cleavage pattern predicted by the presence of the "D" residue is indicated by (*). KLK15 has an 
"E" in this position. A unique 8 amino acid sequence, HNEPGTAG (SEQ ID NO. 10), is present at positions 
148-155 of the KLK15 gene. 

2 0 Figure 3 is a plot of hydrophobicity and hydrophilicity of the KLK15 protein, as compared with the 

prostate specific antigen (PSA). Note the hydrophobic region at the amino terminus, suggesting presence of 
a signal peptide. 

Figure 4 is a dendrogram of the predicted phylogenetic tree for 15 kallikreins and a few other serine 
proteases. The neighbor-joining method was used to align KLK15 with other serine proteases and members 
25 of the kallikrein gene family. The tree grouped the classical kallikreins (hKl, hK2, and PSA) together and 
aligned KLK15 in one group with TLSP and KLK-L3 genes. Other serine proteases were aligned in different 
groups, as shown. KLK represents kallikrein; KLK-L represents kallikrein-like; TLSP represents trypsin-like 
serine protease; NES1 represents normal epithelial cell-specific gene; PSA represents prostate specific antigen; 
hKl and hK2 represents human glandular kallikrein 1 and 2, respectively; and HSCCE represents human 

3 0 stratum corneum chymotryptic enzyme. 

Figure 5 is a schematic presentation of the different splice variants of the KLK15 gene. Exons are 
shown by boxes and introns by the connecting lines. Numbers inside boxes represent the exon lengths in base 
pairs. The arrowhead points to the common start codon and stars to the stop codon positions. The length of the 
predicted polypeptide product is indicated beside each variant in amino acids (AA). The alternative splicing 
3 5 and/or exon skips create a frame shift, which leads to a premature termination. 

Figure 6 shows the relative locations of KLK1, KLK15, and KLK3 genes on chromosome 19ql3.3- 
ql3.4. The two overlapping BAC clones are identified, and the overlap region is hatched. Genes are 
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represented by horizontal arrows denoting the direction of the coding sequence. Distances between genes are 
mentioned in base pairs. Figure is not drawn to scale. 

Figure 7 shows tissue expression of the KLK15 gene, as determined by RT-PCR. KLK15 is primarily 
expressed in the thyroid gland, and to a lower extent in the prostate, salivary and adrenal glands, colon, testis 
5 and kidney. M= Molecular weight marker. For explanation of the multiple PCR bands (alternatively spliced 
forms) see the Example. PCR was performed with primers KLK15-F2 and KLK15-R1. 

Figure 8 shows hormonal regulation of the KLK15 gene in the LNCaP prostate cancer cell line. DHT 
= dihydrotestosterone. Steroids were added at 10" 8 M final concentrations, (-ve) = negative control. Actin was 
used as a control gene. 

1 0 Figure 9 is a schematic diagram showing the comparison of the coding regions of the 15 kallikrein 

genes. Exons are shown by solid bars and introns by the connecting lines. Letters above boxes indicate relative 
positions of the catalytic triad that was found to be conserved in all genes; H denotes histidine, D aspartic acid 
and S serine. Roman numbers indicate intron phases. The intron phase refers to the location of the intron 
within the codon; I denotes that the intron occurs after the first nucleotide of the codon, II the intron occurs 

1 5 after the second nucleotide, 0 the intron occurs between codons. The intron phases are conserved in all genes. 
Numbers inside boxes indicate exon lengths in base pairs. Names inside brackets represent the official 
nomenclature approved by the human gene nomenclature committee. Untranslated 3' and 5' regions and 5' 
untranslated exons are not shown. 
DETAILED DESCRIPTION OF THE INVENTION 

2 0 In accordance with the present invention there may be employed conventional molecular biology, 

microbiology, and recombinant DNA techniques within the skill of the art. Such techniques are explained fully 
in the literature. See for example, Sambrook, Fritsch, & Maniatis, Molecular Cloning: A Laboratory Manual, 
Second Edition (1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y); DNA Cloning: A 
Practical Approach, Volumes I and II (D.N. Glover ed. 1985); Oligonucleotide Synthesis (M..J. Gait ed, 1984); 

2 5 Nucleic Acid Hybridization B.D. Hames & S.J. Higgins eds. (1985); Transcription and Translation B.D. Hames 

& S.J. Higgins eds (1984); Animal Cell Culture R.I. Freshney, ed. (1986); Immobilized Cells and enzymes IRL 
Press, (1986); and B. Perbal, A Practical Guide to Molecular Cloning (1984). 
1. Nucleic Acid Molecules of the Invention 

As hereinbefore mentioned, the invention provides an isolated nucleic acid molecule having a 

3 0 sequence encoding a KLK15 Protein. The term "isolated" refers to a nucleic acid substantially free of cellular 

material or culture medium when produced by recombinant DNA techniques, or chemical reactants, or other 
chemicals when chemically synthesized. An "isolated" nucleic acid may also be free of sequences which 
naturally flank the nucleic acid (i.e., sequences located at the 5' and 3' ends of the nucleic acid molecule) from 
which the nucleic acid is derived. The term "nucleic acid" is intended to include DNA and RNA and can be 
3 5 either double stranded or single stranded. In an embodiment, a nucleic acid molecule of the invention encodes 
a protein comprising an amino acid sequence of SEQ.ID.NO. 6, 7, 8, or 9 preferably a nucleic acid molecule 
of the invention comprises a nucleic acid sequence of one or more of SEQ.ID.NO. 1 through 5, or 10 through 
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In an embodiment, the invention provides an isolated nucleic acid molecule which comprises: 

(i) a nucleic acid sequence encoding a protein having substantial sequence identity with an 
amino acid sequence of SEQ. ID. NO. 6, 7, 8, or 9; 

5 (ii) a nucleic acid sequence encoding a protein comprising an amino acid sequence of SEQ. ID. 

NO. 6, 7, 8, or 9; 

(iii) nucleic acid sequences complementary to (i) or (ii); 

(iv) a degenerate form of a nucleic acid sequence of (i) or (ii); 

(v) a nucleic acid sequence capable of hybridizing under stringent conditions to a nucleic acid 
1 0 sequence in (i), (ii) or (iii); 

(vi) a nucleic acid sequence encoding a truncation, an analog, an allelic or species variation of 
a protein comprising an amino acid sequence of SEQ. ID. NO. 6, 7, 8, or 9; or 

(vii) a fragment, or allelic or species variation of (i), (ii) or (iii). 

Preferably, a purified and isolated nucleic acid molecule of the invention comprises: 
15 (i) a nucleic acid sequence comprising the sequence of one or more of SEQ.ID.NO. 1 through 

5 or 10 through 24,wherein T can also be U; 

(ii) nucleic acid sequences complementary to (i), preferably complementary to the full nucleic 
acid sequence of one or more of SEQ.ID.NO. 1 through 5 or 10 through 24; 

(iii) a nucleic acid capable of hybridizing under stringent conditions to a nucleic acid of (i) or (ii) 
2 0 and preferably having at least 18 nucleotides; or 

(iv) a nucleic acid molecule differing from any of the nucleic acids of (i) to (iii) in codon 
sequences due to the degeneracy of the genetic code. 

The invention includes nucleic acid sequences complementary to a nucleic acid encoding a protein 
comprising an amino acid sequence of SEQ.ID.NO. 6, 7, 8, or 9 preferably the nucleic acid sequences 

2 5 complementary to a full nucleic acid sequence of one or more of SEQ.ID.NO. 1 through 5 or 10 through 24. 

The invention includes nucleic acid molecules having substantial sequence identity or homology to 
nucleic acid sequences of the invention or encoding proteins having substantial identity or similarity to the 
amino acid sequence of SEQ.ID.NO. 6, 7, 8, or 9. Preferably, the nucleic acids have substantial sequence 
identity for example at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, or 85% nucleic acid identity; more 

3 0 preferably 90% nucleic acid identity; and most preferably at least 95%, 96%, 97%, 98%, or 99% sequence 

identity. "Identity" as known in the art and used herein, is a relationship between two or more amino acid 
sequences or two or more nucleic acid sequences, as determined by comparing the sequences. It also refers to 
the degree of sequence relatedness between amino acid or nucleic acid sequences, as the case may be, as 
determined by the match between strings of such sequences. Identity and similarity are well known terms to 
3 5 skilled artisans and they can be calculated by conventional methods (for example see Computational Molecular 
Biology, Lesk, A.M. ed., Oxford University Press, New York, 1988; Biocomputing: Informatics and Genome 
Projects, Smith, D.W. ed., Academic Press, New York, 1993; Computer Analysis of Sequence Data, Part I, 
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Griffin, A.M. and Griffin, H.G. eds., Humana Press, New Jersey, 1994; Sequence Analysis in Molecular 
Biology, von Heinje, G. Acadmeic Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, 
J. eds. M. Stockton Press, New York, 1991, Carillo, H. and Lipman, D., SIAM J. Applied Math. 48:1073, 
1988). Methods which are designed to give the largest match between the sequences are generally preferred. 
5 Methods to determine identity and similarity are codified in publicly available computer programs including 
the GCG program package (Devereux J. et al, Nucleic Acids Research 12(1): 387, 1984); BLASTP, BLASTN, 
and FASTA (Atschul, S.F. et al. J. Molec. Biol. 215: 403-410, 1990). The BLAST X program is publicly 
available from NCBI and other sources (BLAST Manual, Altschul, S. et al. NCBI NLM NIH Bethesda, Md. 
20894; Altschul, S. et al. J. Mol. Biol. 215: 403-410, 1990). 

1 0 Isolated nucleic acid molecules encoding a KLK15 Protein, and having a sequence which differs from 

a nucleic acid sequence of the invention due to degeneracy in the genetic code are also within the scope of the 
invention. Such nucleic acids encode functionally equivalent proteins (e.g. a KLK15 Protein) but differ in 
sequence from the sequence of a KLK15 Protein due to degeneracy in the genetic code. As one example, DNA 
sequence polymorphisms within the nucleotide sequence of a KLK15 Protein may result in silent mutations 

15 which do not affect the amino acid sequence. Variations in one or more nucleotides may exist among 
individuals within a population due to natural allelic variation. Any and all such nucleic acid variations are 
within the scope of the invention. DNA sequence polymorphisms may also occur which lead to changes in the 
amino acid sequence of a KLK15 Protein. These amino acid polymorphisms are also within the scope of the 
present invention. 

2 0 Another aspect of the invention provides a nucleic acid molecule which hybridizes under stringent 

conditions, preferably high stringency conditions to a nucleic acid molecule which comprises a sequence which 
encodes a KLK15 Protein having an amino acid sequence shown in SEQ.ID.NO. 6, 7, 8, or 9. Appropriate 
stringency conditions which promote DNA hybridization are known to those skilled in the art, or can be found 
in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. For example, 6.0 x 
25 sodium chloride/sodium citrate (SSC) at about 45°C, followed by a wash of 2.0 x SSC at 50°C may be 
employed. The stringency may be selected based on the conditions used in the wash step. By way of example, 
the salt concentration in the wash step can be selected from a high stringency of about 0.2 x SSC at 50°C. In 
addition, the temperature in the wash step can be at high stringency conditions, at about 65°C. 

It will be appreciated that the invention includes nucleic acid molecules encoding a KLK15 Related 

3 0 Protein including truncations of a KLK15 Protein, and analogs of a KLK15 Protein as described herein. The 

truncated nucleic acids or nucleic acid fragments may correspond to a sequence comprising or consisting of 
nucleotides 1581-1623, 1524-5258, 5259-5412, 5413-5912, 5913-6078, 6197-6316, 6079-6316, 6317-6453, 
6454-7126, 6079-7126, 7127-7786, 5913-6196, 7127-7131, or 7127-7279 of SEQ ID NO. 1, or SEQ ID NO. 
39, 47, 48, 49, or 50. It will further be appreciated that variant forms of the nucleic acid molecules of the 
3 5 invention which arise by alternative splicing of an mRNA corresponding to a cDNA of the invention are 
encompassed by the invention (See SEQ ID NO. 3, 4, and 5). 

An isolated nucleic acid molecule of the invention which comprises DNA can be isolated by preparing 
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a labelled nucleic acid probe based on all or part of a nucleic acid sequence of the invention. The labeled 
nucleic acid probe is used to screen an appropriate DNA library (e.g. a cDNA or genomic DNA library). For 
example, a cDNA library can be used to isolate a cDNA encoding a KLK15 Related Protein by screening the 
library with the labeled probe using standard techniques. Alternatively, a genomic DNA library can be similarly 
5 screened to isolate a genomic clone encompassing a gene encoding a KLK15 Related Protein. Nucleic acids 
isolated by screening of a cDNA or genomic DNA library can be sequenced by standard techniques. 

An isolated nucleic acid molecule of the invention which is DNA can also be isolated by selectively 
amplifying a nucleic acid encoding a KLK15 Related Protein using the polymerase chain reaction (PCR) 
methods and cDNA or genomic DNA. It is possible to design synthetic oligonucleotide primers from the 

1 0 nucleotide sequence of the invention for use in PCR. A nucleic acid can be amplified from cDNA or genomic 
DNA using these oligonucleotide primers and standard PCR amplification techniques. The nucleic acid so 
amplified can be cloned into an appropriate vector and characterized by DNA sequence analysis. cDNA may 
be prepared from mRNA, by isolating total cellular mRNA by a variety of techniques, for example, by using 
the guanidinium-thiocyanate extraction procedure of Chirgwin et al., Biochemistry, 18, 5294-5299 (1979). 

1 5 cDNA is then synthesized from the mRNA using reverse transcriptase (for example, Moloney MLV reverse 
transcriptase available from Gibco/BRL, Bethesda, MD, or AMV reverse transcriptase available from 
Seikagaku America, Inc., St. Petersburg, PL). 

An isolated nucleic acid molecule of the invention which is RNA can be isolated by cloning a cDNA 
encoding a KLK15 Related Protein into an appropriate vector which allows for transcription of the cDNA to 

2 0 produce an RNA molecule which encodes a KLK15 Related Protein. For example, a cDNA can be cloned 
downstream of a bacteriophage promoter, (e.g. a T7 promoter) in a vector, cDNA can be transcribed in vitro 
with T7 polymerase, and the resultant RNA can be isolated by conventional techniques. 

Nucleic acid molecules of the invention may be chemically synthesized using standard techniques. 
Methods of chemically synthesizing polydeoxynucleotides are known, including but not limited to solid-phase 

2 5 synthesis which, like peptide synthesis, has been fully automated in commercially available DNA synthesizers 

(See e.g., Itakura et al. U.S. Patent No. 4,598,049; Caruthers et al. U.S. Patent No. 4,458,066; and Itakura U.S. 
Patent Nos. 4,401,796 and 4,373,071). 

Determination of whether a particular nucleic acid molecule encodes a KLK15 Related Protein can 
be accomplished by expressing the cDNA in an appropriate host cell by standard techniques, and testing the 

3 0 expressed protein in the methods described herein. A cDNA encoding a KLK15 Related Protein can be 

sequenced by standard techniques, such as dideoxynucleotide chain termination or Maxam-Gilbert chemical 
sequencing, to determine the nucleic acid sequence and the predicted amino acid sequence of the encoded 
protein. 

The initiation codon and untranslated sequences of a KLK15 Related Protein may be determined using 
3 5 computer software designed for the purpose, such as PC/Gene (IntelliGenetics Inc., Calif.). The intron-exon 
structure and the transcription regulatory sequences of a gene encoding a KLK15 Related Protein may be 
confirmed by using a nucleic acid molecule of the invention encoding a KLK15 Related Protein to probe a 
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genomic DNA clone library. Regulatory elements can be identified using standard techniques. The function 
of the elements can be confirmed by using these elements to express a reporter gene such as the lacZ gene that 
is operatively linked to the elements. These constructs may be introduced into cultured cells using conventional 
procedures or into non-human transgenic animal models. In addition to identifying regulatory elements in DNA, 
5 such constructs may also be used to identify nuclear proteins interacting with the elements, using techniques 
known in the art. In an embodiment, regulatory sequences of a nucleic acid molecule of the invention comprise 
the sequence of SEQ ID NO. 1 1 . 

In a particular embodiment of the invention, the nucleic acid molecules isolated using the methods 
described herein are mutant KLK15 gene alleles. The mutant alleles may be isolated from individuals either 

1 0 known or proposed to have a genotype which contributes to the symptoms of a disorder involving a KLK15 
Related Protein. Mutant alleles and mutant allele products may be used in therapeutic and diagnostic methods 
described herein. For example, a cDNA of a mutant KLK15 gene may be isolated using PCR as described 
herein, and the DNA sequence of the mutant allele may be compared to the normal allele to ascertain the 
mutation(s) responsible for the loss or alteration of function of the mutant gene product. A genomic library can 

1 5 also be constructed using DNA from an individual suspected of or known to carry a mutant allele, or a cDNA 
library can be constructed using RNA from tissue known, or suspected to express the mutant allele. A nucleic 
acid encoding a normal KLK15 gene or any suitable fragment thereof, may then be labeled and used as a probe 
to identify the corresponding mutant allele in such libraries. Clones containing mutant sequences can be 
purified and subjected to sequence analysis. In addition, an expression library can be constructed using cDNA 

2 0 from RNA isolated from a tissue of an individual known or suspected to express a mutant KLK15 allele. Gene 
products made by the putatively mutant tissue may be expressed and screened, for example using antibodies 
specific for a KLK15 Related Protein as described herein. Library clones identified using the antibodies can 
be purified and subjected to sequence analysis. 

The sequence of a nucleic acid molecule of the invention, or a fragment of the molecule, may be 

2 5 inverted relative to its normal presentation for transcription to produce an antisense nucleic acid molecule. An 

antisense nucleic acid molecule may be constructed using chemical synthesis and enzymatic ligation reactions 
using procedures known in the art. 
2. Proteins of the Invention 

An amino acid sequence of a KLK15 Protein comprises a sequence as shown in SEQ.ED.NO. 6, 7, 

3 0 8, or 9. The protein is primarily expressed in the thyroid gland, and to a lower extent in the prostate, salivary 

and adrenal glands, colon, testis, and kidney. 

In addition to proteins comprising an amino acid sequence as shown in SEQ.ID.NO. 6, 7, 8, or 9 the 
proteins of the present invention include truncations of a KLK15 Protein, analogs of a KLK15 Protein, and 
proteins having sequence identity or similarity to a KLK15 Protein, and truncations thereof as described 
3 5 herein (i.e. KLK15 Related Proteins). 

Truncated proteins may comprise peptides of between 3 and 70 amino acid residues, ranging in size 
from a tripeptide to a 70 mer polypeptide. In a preferred embodiment the peptide is HNEPGTAG (SEQ ID NO 
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10). The truncated proteins may have an amino group (-NH2), a hydrophobic group (for example, 
carbobenzoxyl, dansyl, or T-butyloxycarbonyl), an acetyl group, a 9-fluorenylmethoxy-carbonyl (PMOC) 
group, or a macromolecule including but not limited to lipid-fatty acid conjugates, polyethylene glycol, or 
carbohydrates at the amino terminal end. The truncated proteins may have a carboxyl group, an amido group, 
5 a T-butyloxycarbonyl group, or a macromolecule including but not limited to lipid-fatty acid conjugates, 
polyethylene glycol, or carbohydrates at the carboxy terminal end. 

The proteins of the invention may also include analogs of a KLK15 Protein, and/or truncations thereof 
as described herein, which may include, but are not limited to a KLK15 protein, containing one or more amino 
acid substitutions, insertions, and/or deletions. Amino acid substitutions may be of a conserved or non- 

1 0 conserved nature. Conserved amino acid substitutions involve replacing one or more amino acids of a KLK15 
Protein amino acid sequence with amino acids of similar charge, size, and/or hydrophobicity characteristics. 
When only conserved substitutions are made the resulting analog is preferably functionally equivalent to a 
KLK15 Protein. Non-conserved substitutions involve replacing one or more amino acids of the KLK15 Protein 
amino acid sequence with one or more amino acids that possess dissimilar charge, size, and/or hydrophobicity 

1 5 characteristics. 

One or more amino acid insertions may be introduced into a KLK15 Protein. Amino acid insertions 
may consist of single amino acid residues or sequential amino acids ranging from 2 to 15 amino acids in length. 

Deletions may consist of the removal of one or more amino acids, or discrete portions from a KLK15 
Protein sequence. The deleted amino acids may or may not be contiguous. The lower limit length of the 
2 0 resulting analog with a deletion mutation is about 10 amino acids, preferably 20 to 40 amino acids. 

The proteins of the invention include proteins with sequence identity or similarity to a KLK15 Protein 
and/or truncations thereof as described herein. Such KLK15 Proteins include proteins whose amino acid 
sequences are comprised of the amino acid sequences of KLK15 Protein regions from other species that 
hybridize under selected hybridization conditions (see discussion of stringent hybridization conditions herein) 

2 5 with a probe used to obtain a KLK15 Protein. These proteins will generally have the same regions which are 

characteristic of a KLK15 Protein. Preferably a protein will have substantial sequence identity for example, 
about 55%, 60%, 65%, 70%, 75%, 80%, or 85% identity, preferably 90% identity, more preferably at least 
95%, 96%, 97%, 98%, or 99% identity, and most preferably 98% identity with an amino acid sequence of 
SEQ.ID.NO. 6, 7, 8, or 9. A percent amino acid sequence homology, similarity or identity is calculated as the 

3 0 percentage of aligned amino acids that match the reference sequence using known methods as described herein. 

The invention also contemplates isoforms of the proteins of the invention. An isoform contains the 
same number and kinds of amino acids as a protein of the invention, but the isoform has a different molecular 
structure. Isoforms contemplated by the present invention preferably have the same properties as a protein of 
the invention as described herein. 
3 5 The present invention also includes KLK15 Related Proteins conjugated with a selected protein, or 

a marker protein (see below) to produce fusion proteins. Additionally, immunogenic portions of a KLK15 
Protein and a KLK15 Protein Related Protein are within the scope of the invention. 
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A KLK15 Related Protein of the invention may be prepared using recombinant DNA methods. 
Accordingly, the nucleic acid molecules of the present invention having a sequence which encodes a KLK15 
Related Protein of the invention may be incorporated in a known manner into an appropriate expression vector 
which ensures good expression of the protein. Possible expression vectors include but are not limited to 
5 cosmids, plasmids, or modified viruses (e.g. replication defective retroviruses, adenoviruses and adeno- 
associated viruses), so long as the vector is compatible with the host cell used. 

The invention therefore contemplates a recombinant expression vector of the invention containing a 
nucleic acid molecule of the invention, and the necessary regulatory sequences for the transcription and 
translation of the inserted protein-sequence. Suitable regulatory sequences may be derived from a variety of 

10 sources, including bacterial, fungal, viral, mammalian, or insect genes [For example, see the regulatory 
sequences described in Goeddel, Gene Expression Technology: Methods in Enzymology 185, Academic Press, 
San Diego, CA (1990)]. Selection of appropriate regulatory sequences is dependent on the host cell chosen as 
discussed below, and may be readily accomplished by one of ordinary skill in the art. The necessary regulatory 
sequences may be supplied by the native KLK15 Protein and/or its flanking regions. 

1 5 The invention further provides a recombinant expression vector comprising a DNA nucleic acid 

molecule of the invention cloned into the expression vector in an antisense orientation. That is, the DNA 
molecule is linked to a regulatory sequence in a manner which allows for expression, by transcription of the 
DNA molecule, of an RNA molecule which is antisense to the nucleic acid sequence of a protein of the 
invention or a fragment thereof. Regulatory sequences linked to the antisense nucleic acid can be chosen which 

2 0 direct the continuous expression of the antisense RNA molecule in a variety of cell types, for instance a viral 

promoter and/or enhancer, or regulatory sequences can be chosen which direct tissue or cell type specific 
expression of antisense RNA. 

The recombinant expression vectors of the invention may also contain a marker gene which facilitates 
the selection of host cells transformed or transfected with a recombinant molecule of the invention. Examples 
25 of marker genes are genes encoding a protein such as G418 and hygromycin which confer resistance to certain 
drugs, P-galactosidase, chloramphenicol acetyltransferase, firefly luciferase, or an immunoglobulin or portion 
thereof such as the Fc portion of an immunoglobulin, preferably IgG. The markers can be introduced on a 
separate vector from the nucleic acid of interest. 

The recombinant expression vectors may also contain genes that encode a fusion moiety which 

3 0 provides increased expression of the recombinant protein; increased solubility of the recombinant protein; and 

aid in the purification of the target recombinant protein by acting as a ligand in affinity purification. For 
example, a proteolytic cleavage site may be added to the target recombinant protein to allow separation of the 
recombinant protein from the fusion moiety subsequent to purification of the fusion protein. Typical fusion 
expression vectors include pGEX (Amrad Corp., Melbourne, Australia), pMAL (New England Biolabs, 
3 5 Beverly, MA) and pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase (GST), maltose 
E binding protein, or protein A, respectively, to the recombinant protein. 

The recombinant expression vectors may be introduced into host cells to produce a transformant host 
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cell. "Transformant host cells" include host cells which have been transformed or transfected with a 
recombinant expression vector of the invention. The terms "transformed with", "transfected with", 
"transformation" and "transfection" encompass the introduction of a nucleic acid (e.g. a vector) into a cell by 
one of many standard techniques. Prokaryotic cells can be transformed with a nucleic acid by, for example, 
5 electroporation or calcium-chloride mediated transformation. A nucleic acid can be introduced into mammalian 
cells via conventional techniques such as calcium phosphate or calcium chloride co-precipitation, DEAE- 
dextran-mediated transfection, lipofectin, electroporation or microinjection. Suitable methods for transforming 
and transfecting host cells can be found in Sambrook et al. (Molecular Cloning: A Laboratory Manual, 2nd 
Edition, Cold Spring Harbor Laboratory press (1989)), and other laboratory textbooks. 

1 0 Suitable host cells include a wide variety of prokaryotic and eukaryotic host cells. For example, the 

proteins of the invention may be expressed in bacterial cells such as E. coli, insect cells (using baculo virus), 
yeast cells, or mammalian cells. Other suitable host cells can be found in Goeddel, Gene Expression 
Technology: Methods in Enzymology 185, Academic Press, San Diego, CA (1991). 

A host cell may also be chosen which modulates the expression of an inserted nucleic acid sequence, 

15 or modifies (e.g. glycosylation or phosphorylation) and processes (e.g. cleaves) the protein in a desired fashion. 
Host systems or cell lines may be selected which have specific and characteristic mechanisms for post- 
translational processing and modification of proteins. For example, eukaryotic host cells including CHO, 
VERO, BHK, HeLA, COS, MDCK, 293, 3T3, and WI38 may be used. For long-term high-yield stable 
expression of the protein, cell lines and host systems which stably express the gene product may be engineered. 

2 0 Host cells and in particular cell lines produced using the methods described herein may be particularly 

useful in screening and evaluating compounds that modulate the activity of a KLK15 Related Protein. 

The proteins of the invention may also be expressed in non-human transgenic animals including but 
not limited to mice, rats, rabbits, guinea pigs, micro-pigs, goats, sheep, pigs, non-human primates (e.g. baboons, 
monkeys, and chimpanzees) [see Hammer et al. (Nature 315:680-683, 1985), Palrniter et al. (Science 222:809- 
25 814, 1983), Brinster et al. (Proc Nad. Acad. Sci USA 82:44384442, 1985), Palrniter and Brinster (Cell 41 :343- 
345, 1985) and U.S. Patent No. 4,736,866)]. Procedures known in the art may be used to introduce a nucleic 
acid molecule of the invention encoding a KLK15 Related Protein into animals to produce the founder lines 
of transgenic animals. Such procedures include pronuclear microinjection, retrovirus mediated gene transfer 
into germ lines, gene targeting in embryonic stem cells, electroporation of embryos, and sperm-mediated gene 

3 0 transfer. 

The present invention contemplates a transgenic animal that carries the KLK15 gene in all their cells, 
and animals which carry the transgene in some but not all their cells. The transgene may be integrated as a 
single transgene or in concatamers. The transgene may be selectively introduced into and activated in specific 
cell types (See for example, Lasko et al, 1992 Proc. Natl. Acad. Sci. USA 89: 6236). The transgene may be 
3 5 integrated into the chromosomal site of the endogenous gene by gene targeting. The transgene may be 
selectively introduced into a particular cell type inactivating the endogenous gene in that cell type (See Gu et 
al Science 265: 103-106). 
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The expression of a recombinant KLK15 Related Protein in a transgenic animal may be assayed using 
standard techniques. Initial screening may be conducted by Southern Blot analysis, or PCR methods to analyze 
whether the transgene has been integrated. The level of mRNA expression in the tissues of transgenic animals 
may also be assessed using techniques including Northern blot analysis of tissue samples, in situ hybridization, 
5 and RT-PCR. Tissue may also be evaluated immunocytochemically using antibodies against KLK15 Protein. 

Proteins of the invention may also be prepared by chemical synthesis using techniques well known 
in the chemistry of proteins such as solid phase synthesis (Merrifield, 1964, J. Am. Chem. Assoc. 85:2149- 
2154) or synthesis in homogenous solution (Houbenweyl, 1987, Methods of Organic Chemistry, ed. E. 
Wansch, Vol. 15 I and II, Thieme, Stuttgart). 

1 0 N- terminal or C-terminal fusion proteins comprising a KLK15 Related Protein of the invention 

conjugated with other molecules, such as proteins, may be prepared by fusing, through recombinant techniques, 
the N-terminal or C-terminal of a KLK15 Related Protein, and the sequence of a selected protein or marker 
protein with a desired biological function. The resultant fusion proteins contain a KLK15 Protein fused to the 
selected protein or marker protein as described herein. Examples of proteins which may be used to prepare 

1 5 fusion proteins include immunoglobulins, glutathione-S-transferase (GST), hemagglutinin (HA), and truncated 
myc. 

3. Antibodies 

KLK15 Related Proteins of the invention can be used to prepare antibodies specific for the proteins. 
Antibodies can be prepared which bind a distinct epitope in an unconserved region of the protein. An 
2 0 unconserved region of the protein is one that does not have substantial sequence homology to other proteins. 
A region from a conserved region such as a well-characterized domain can also be used to prepare an antibody 
to a conserved region of a KLK15 Related Protein. Antibodies having specificity for a KLK15 Related 
Protein may also be raised from fusion proteins created by expressing fusion proteins in bacteria as described 
herein. 

2 5 The invention can employ intact monoclonal or polyclonal antibodies, and immunologically active 

fragments (e.g. a Fab, (Fab) 2 fragment, or Fab expression library fragments and epitope-binding fragments 
thereof), an antibody heavy chain, and antibody light chain, a genetically engineered single chain Fv molecule 
(Ladner et al, U.S. Pat. No. 4,946,778), humanized antibody, or a chimeric antibody, for example, an antibody 
which contains the binding specificity of a murine antibody, but in which the remaining portions are of human 

3 0 origin. Antibodies including monoclonal and polyclonal antibodies, fragments and chimeras, may be prepared 

using methods known to those skilled in the art. 

4. Applications of the Nucleic Acid Molecules, KLK15 Related Proteins, and Antibodies of the 
Invention 

The nucleic acid molecules, KLK15 Related Proteins, and antibodies of the invention may be used 
3 5 in the prognostic and diagnostic evaluation of disorders involving a KLK15 Related Protein (e.g. cancer or 
thyroid disorders), and the identification of subjects with a predisposition to such disorders (Section 4.1.1 and 
4.1.2). 
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In an embodiment of the invention, a method is provided for detecting the expression of the cancer 
marker KLK15 in a patient comprising: 

(a) taking a sample derived from a patient; and 

(b) detecting in the sample a nucleic acid sequence encoding KLK15 or a protein product encoded 
5 by a KLK15 nucleic acid sequence. 

In a particular embodiment of the invention, the nucleic acid molecules, KLK15 Related Proteins, and 
antibodies of the invention may be used in the diagnosis and staging of cancer, in particular prostate cancer. 
Increased levels of KLK15 Related Proteins are associated with more aggressive forms of prostate cancer and 
may be an indicator of poor prognosis. 

1 0 Methods for detecting nucleic acid molecules and KLK15 Related Proteins of the invention, can be 

used to monitor disorders involving a KLK15 Related Protein by detecting KLK15 Related Proteins and 
nucleic acid molecules encoding KLK15 Related Proteins. The applications of the present invention also 
include methods for the identification of compounds that modulate the biological activity of KLK15 Related 
Proteins (Section 4.2). The compounds, antibodies etc. may be used for the treatment of disorders involving 

15 a KLK15 Related Protein (Section 4.3), It would also be apparent to one skilled in the art that the methods 
described herein may be used to study the developmental expression of KLK15 Related Proteins and, 
accordingly, will provide further insight into the role of KLK15 Related Proteins. 
4.1 Diagnostic Methods 

A variety of methods can be employed for the diagnostic and prognostic evaluation of disorders 

2 0 involving a KLK15 Related Protein, and the identification of subjects with a predisposition to such disorders. 
Such methods may, for example, utilize nucleic acid molecules of the invention, and fragments thereof, and 
antibodies directed against KLK15 Related Proteins, including peptide fragments. In particular, the nucleic 
acids and antibodies may be used, for example, for: (1) the detection of the presence of KLK15 mutations, or 
the detection of either over- or under-expression of KLK15 mRNA relative to a non-disorder state or the 

2 5 qualitative or quantitative detection of alternatively spliced forms of KLK15 transcripts which may correlate 

with certain conditions or susceptibility toward such conditions; and (2) the detection of either an over- or an 
under-abundance of KLK15 Related Proteins relative to a non- disorder state or the presence of a modified 
(e.g., less than full length) KLK15 Protein which correlates with a disorder state, or a progression toward a 
disorder state. 

3 0 The methods described herein may be used to evaluate the probability of the presence of malignant 

or pre-malignant cells, for example, in a group of cells freshly removed from a host. Such methods can be used 
to detect tumors, quantitate their growth, and help in the diagnosis and prognosis of disease. The methods can 
be used to detect the presence of cancer metastasis, as well as confirm the absence or removal of all tumor 
tissue following surgery, cancer chemotherapy, and/or radiation therapy. They can further be used to monitor 
3 5 cancer chemotherapy and tumor reappearance. 

The methods described herein may be performed by utilizing pre-packaged diagnostic kits comprising 
at least one specific KLK15 nucleic acid or antibody described herein, which may be conveniently used, e.g., 
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in clinical settings, to screen and diagnose patients and to screen and identity those individuals exhibiting a 
predisposition to developing a disorder. 

Nucleic acid-based detection techniques are described, below, in Section 4.1.1. Peptide detection 
techniques are described, below, in Section 4.1.2. The samples that may be analyzed using the methods of the 
5 invention include those which are known or suspected to express KLK15 or contain KLK15 Related Proteins. 
The samples may be derived from a patient or a cell culture, and include but are not limited to biological fluids, 
tissue extracts, freshly harvested cells, and lysates of cells which have been incubated in cell cultures. 

Oligonucleotides or longer fragments derived from any of the nucleic acid molecules of the invention 
may be used as targets in a microarray. The microarray can be used to simultaneously monitor the expression 
10 levels of large numbers of genes and to identify genetic variants, mutations, and polymorphisms. The 
information from the microarray may be used to determine gene function, to understand the genetic basis of 
a disorder, to diagnose a disorder, and to develop and monitor the activities of therapeutic agents. 

The preparation, use, and analysis of microarrays are well known to a person skilled in the art. (See, 
for example, Brennan, T. M. et al. (1995) U.S. Pat. No. 5,474,796; Schena, et al. (1996) Proc. Natl. Acad. Sci. 
1 5 93:10614-10619; Baldeschweiler et al. (1995), PCT Application W095/251 116; Shalon, D. et al. (1 995) PCT 
application WO95/35505; Heller, R. A. et al. (1997) Proc. Natl. Acad. Sci. 94:2150-2155; and Heller, M. J. 
et al. (1997) U.S. Pat. No. 5,605,662.) 

4.1,1 Methods for Detecting Nucleic Acid Molecules of the Invention 

The nucleic acid molecules of the invention allow those skilled in the art to construct nucleotide 
2 0 probes for use in the detection of nucleic acid sequences of the invention in samples. Suitable probes include 
nucleic acid molecules based on nucleic acid sequences encoding at least 5 sequential amino acids from regions 
of the KLK15 Protein, preferably they comprise 15 to 30 nucleotides (see SEQ ID Nos. 47-50). A nucleotide 
probe may be labeled with a detectable substance such as a radioactive label which provides for an adequate 
signal and has sufficient half-life such as 32 P, 3 H, 14 C or the like. Other detectable substances which may be 

2 5 used include antigens that are recognized by a specific labeled antibody, fluorescent compounds, enzymes, 

antibodies specific for a labeled antigen, and luminescent compounds. An appropriate label may be selected 
having regard to the rate of hybridization and binding of the probe to the nucleotide to be detected and the 
amount of nucleotide available for hybridization. Labeled probes may be hybridized to nucleic acids on solid 
supports such as nitrocellulose filters or nylon membranes as generally described in Sambrook et al, 1989, 

3 0 Molecular Cloning, A Laboratory Manual (2nd ed.). The nucleic acid probes may be used to detect genes, 

preferably in human cells, that encode KLK15 Related Proteins. The nucleotide probes may also be useful in 
the diagnosis of disorders involving a KLK15 Related Protein; in monitoring the progression of such disorders; 
or monitoring a therapeutic treatment. 

The probe may be used in hybridization techniques to detect genes that encode KLK15 Related 
3 5 Proteins. The technique generally involves contacting and incubating nucleic acids (e.g. recombinant DNA 
molecules, cloned genes) obtained from a sample from a patient or other cellular source with a probe of the 
present invention under conditions favorable for the specific annealing of the probes to complementary 
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sequences in the nucleic acids. After incubation, the non-annealed nucleic acids are removed, and the presence 
of nucleic acids that have hybridized to the probe if any are detected. 

The detection of nucleic acid molecules of the invention may involve the amplification of specific 
gene sequences using an amplification method such as PCR, followed by the analysis of the amplified 
5 molecules using techniques known to those skilled in the art. Suitable primers can be routinely designed by one 
of skill in the art. 

Genomic DNA may be used in hybridization or amplification assays of biological samples to detect 
abnormalities involving KLK15 structure, including point mutations, insertions, deletions, and chromosomal 
rearrangements. For example, direct sequencing, single stranded conformational polymorphism analyses, 
10 heteroduplex analysis, denaturing gradient gel electrophoresis, chemical mismatch cleavage, and 
oligonucleotide hybridization may be utilized. 

Genotyping techniques known to one skilled in the art can be used to type polymorphisms that are in 
close proximity to the mutations in a klkl5 gene. The polymorphisms may be used to identify individuals in 
families that are likely to carry mutations. If a polymorphism exhibits linkage disequalibrium with mutations 
15 in a KLK15 gene, it can also be used to screen for individuals in the general population likely to carry 
mutations. Polymorphisms which may be used include restriction fragment length polymorphisms (RFLPs), 
single-base polymorphisms, and simple sequence repeat polymorphisms (SSLPs). 

A probe of the invention may be used to directly identify RFLPs. A probe or primer of the invention 
can additionally be used to isolate genomic clones such as YACs, B ACs, PACs, cosmids, phage or plasmids. 
2 0 The DNA in the clones can be screened for SSLPs using hybridization or sequencing procedures. 

Hybridization and amplification techniques described herein may be used to assay qualitative and 
quantitative aspects of klkl5 expression. For example, RNA may be isolated from a cell type or tissue known 
to express klkl5 and tested utilizing the hybridization (e.g. standard Northern analyses) or PCR techniques 
referred to herein. The techniques may be used to detect differences in transcript size which may be due to 

2 5 normal or abnormal alternative splicing. The techniques may be used to detect quantitative differences between 

levels of full length and/or alternatively splice transcripts detected in normal individuals relative to those 
individuals exhibiting symptoms of a disorder involving a KLK15 Related Protein. 

The primers and probes may be used in the above described methods in situ i.e directly on tissue 
sections (fixed and/or frozen) of patient tissue obtained from biopsies or resections. 

3 0 4.1.2 Methods for Detecting KLK15 Related Proteins 

Antibodies specifically reactive with a KLK15 Related Protein, or derivatives, such as enzyme 
conjugates or labeled derivatives, may be used to detect KLK15 Related Proteins in various samples (e.g. 
biological materials). They may be used as diagnostic or prognostic reagents and they may be used to detect 
abnormalities in the level of KLK15 Related Protein expression, or abnormalities in the structure, and/or 
3 5 temporal, tissue, cellular, or subcellular location of a KLK15 Related Protein. Antibodies may also be used to 
screen potentially therapeutic compounds in vitro to determine their effects on disorders involving a KLK15 
Related Protein, and other conditions. In vitro immunoassays may also be used to assess or monitor the efficacy 
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of particular therapies. The antibodies of the invention may also be used in vitro to determine the level of 
KLK15 expression in cells genetically engineered to produce a KLK15 Related Protein. 

The antibodies may be used in any known immunoassays which rely on the binding interaction 
between an antigenic determinant of a KLK15 Related Protein and the antibodies. Examples of such assays are 
5 radioimmunoassays, enzyme immunoassays (e.g. ELISA), immunofluorescence, immunoprecipitation, latex 
agglutination, hemagglutination, and histochemical tests. The antibodies may be used to detect and quantify 
KLK15 Related Proteins in a sample in order to determine its role in particular cellular events or pathological 
states, and to diagnose and treat such pathological states. 

In particular, the antibodies of the invention may be used in immuno-histochemical analyses, for 

1 0 example, at the cellular and sub-subcellular level, to detect a KLK15 Related Protein, to localize it to particular 
cells and tissues, and to specific subcellular locations, and to quantitate the level of expression. 

Cytochemical techniques known in the art for localizing antigens using light and electron microscopy 
may be used to detect a KLK15 Related Protein. Generally, an antibody of the invention may be labeled with 
a detectable substance and a KLK15 Related Protein may be localised in tissues and cells based upon the 

1 5 presence of the detectable substance. Examples of detectable substances include, but are not limited to, the 
following: radioisotopes (e.g., 3 H, 14 C, 35 S, 125 I, 131 I), fluorescent labels (e.g., F1TC, rhodamine, lanthanide 
phosphors), luminescent labels such as luminol; enzymatic labels (e.g., horseradish peroxidase, beta- 
galactosidase, luciferase, alkaline phosphatase, acetylcholinesterase), biotinyl groups (which can be detected 
by marked avidin e.g., streptavidin containing a fluorescent marker or enzymatic activity that can be detected 

2 0 by optical or calorimetric methods), predetermined polypeptide epitopes recognized by a secondary reporter 
(e.g., leucine zipper pair sequences, binding sites for secondary antibodies, metal binding domains, epitope 
tags). In some embodiments, labels are attached via spacer arms of various lengths to reduce potential steric 
hindrance. Antibodies may also be coupled to electron dense substances, such as ferritin or colloidal gold, 
which are readily visualised by electron microscopy. 

2 5 The antibody or sample may be immobilized on a carrier or solid support which is capable of 

immobilizing cells, antibodies etc. For example, the carrier or support may be nitrocellulose, or glass, 
polyacrylamides, gabbros, and magnetite. The support material may have any possible configuration including 
spherical (e.g. bead), cylindrical (e.g. inside surface of a test tube or well, or the external surface of a rod), or 
flat (e.g. sheet, test strip). Indirect methods may also be employed in which the primary antigen-antibody 

3 0 reaction is amplified by the introduction of a second antibody, having specificity for the antibody reactive 

against KLK15 Related Protein. By way of example, if the antibody having specificity a KLK15 Related 
Protein is a rabbit IgG antibody, the second antibody may be goat anti-rabbit gamma-globulin labeled with a 
detectable substance as described herein. 

Where a radioactive label is used as a detectable substance, a KLK15 Related Protein may be 
3 5 localized by radioautography. The results of radioautography may be quantitated by determining the density 
of particles in the radio auto graphs by various optical methods, or by counting the grains. 

In an embodiment, the invention contemplates a method for monitoring the progression of cancer (e.g. 
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prostate cancer) in an individual, comprising: 

(a) contacting an amount of an antibody which binds to a KLK15 Related Protein, with a sample from 
the individual so as to form a binary complex comprising the antibody and KLK15 Related Protein 
in the sample; 

5 (b) determining or detecting the presence or amount of complex formation in the sample; 

(c) repeating steps (a) and (b) at a point later in time; and 

(d) comparing the result of step (b) with the result of step (c), wherein a difference in the amount of 
complex formation is indicative of the progression of the cancer in said individual. 

The amount of complexes may also be compared to a value representative of the amount of the 
1 0 complexes from an individual not at risk of, or afflicted with, cancer (e.g. prostate cancer). 
4.2 Methods for Identifying or Evaluating Substances/Compounds 

The methods described herein are designed to identify substances that modulate the biological activity 
of a KLK15 Related Protein including substances that bind to KLK15 Related Proteins, or bind to other 
proteins that interact with a KLK15 Related Protein, to compounds that interfere with, or enhance the 
15 interaction of a KLK 15 Related Protein and substances that bind to the KLK1 5 Related Protein or other 
proteins that interact with a KLK 15 Related Protein. Methods are also utilized that identify compounds that 
bind to KLK15 regulatory sequences. 

The substances and compounds identified using the methods of the invention include but are not 
limited to peptides such as soluble peptides including Ig-tailed fusion peptides, members of random peptide 
2 0 libraries and combinatorial chemistry-derived molecular libraries made of D- and/or L-configuration amino 
acids, phosphopeptides (including members of random or partially degenerate, directed phosphopeptide 
libraries), antibodies [e.g. polyclonal, monoclonal, humanized, anti-idiotypic, chimeric, single chain antibodies, 
fragments, (e.g. Fab, F(ab)2, and Fab expression library fragments, and epitope-binding fragments thereof)], 
and small organic or inorganic molecules. The substance or compound may be an endogenous physiological 

2 5 compound or it may be a natural or synthetic compound. 

Substances which modulate a KLK 15 Related Protein can be identified based on their ability to bind 
to a KLK 15 Related Protein. Therefore, the invention also provides methods for identifying substances which 
bind to a KLK15 Related Protein. Substances identified using the methods of the invention may be isolated, 
cloned and sequenced using conventional techniques. A substance that associates with a polypeptide of the 

3 0 invention may be an agonist or antagonist of the biological or immunological activity of a polypeptide of the 

invention. 

The term "agonist", refers to a molecule that increases the amount of, or prolongs the duration of, the 
activity of the protein. The term "antagonist" refers to a molecule which decreases the biological or 
immunological activity of the protein. Agonists and antagonists may include proteins, nucleic acids, 
3 5 carbohydrates, or any other molecules that associate with a protein of the invention. 

Substances which can bind with a KLK 15 Related Protein may be identified by reacting a KLK 15 
Related Protein with a test substance which potentially binds to a KLK15 Related Protein, under conditions 
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which permit the formation of substance-KLK 1 5 Related Protein complexes and removing and/or detecting 
the complexes. The complexes can be detected by assaying for substance-KLK 15 Related Protein complexes, 
for free substance, or for non-complexed KLK15 Related Protein. Conditions which permit the formation of 
substance-KLK15 Related Protein complexes may be selected having regard to factors such as the nature and 
5 amounts of the substance and the protein. 

The substance-protein complex, free substance or non-complexed proteins may be isolated by 
conventional isolation techniques, for example, salting out, chromatography, electrophoresis, gel filtration, 
fractionation, absorption, polyacrylamide gel electrophoresis, agglutination, or combinations thereof. To 
facilitate the assay of the components, antibody against KLK15 Related Protein or the substance, or labeled 

1 0 KLK15 Related Protein, or a labeled substance may be utilized. The antibodies, proteins, or substances may 
be labeled with a detectable substance as described above. 

A KLK15 Related Protein, or the substance used in the method of the invention may be insolubilized. 
For example, a KLK15 Related Protein, or substance may be bound to a suitable carrier such as agarose, 
cellulose, dextran, Sephadex, Sepharose, carboxymethyl cellulose polystyrene, filter paper, ion-exchange resin, 

15 plastic film, plastic tube, glass beads, polyamine-methyl vinyl-ether-maleic acid copolymer, amino acid 
copolymer, ethylene-maleic acid copolymer, nylon, silk, etc. The carrier may be in the shape of, for example, 
a tube, test plate, beads, disc, sphere etc. The insolubilized protein or substance may be prepared by reacting 
the material with a suitable insoluble carrier using known chemical or physical methods, for example, cyanogen 
bromide coupling. 

2 0 The invention also contemplates a method for evaluating a compound for its ability to modulate the 

biological activity of a KLK15 Related Protein of the invention, by assaying for an agonist or antagonist (i.e. 
enhancer or inhibitor) of the binding of a KLK15 Related Protein with a substance which binds with a KLK15 
Related Protein. The basic method for evaluating if a compound is an agonist or antagonist of the binding of 
a KLK15 Related Protein and a substance that binds to the protein, is to prepare a reaction mixture containing 

2 5 the KLK15 Related Protein and the substance under conditions which permit the formation of substance- 

KLK 15 Related Protein complexes, in the presence of a test compound. The test compound may be initially 
added to the mixture, or may be added subsequent to the addition of the KLK15 Related Protein and substance. 
Control reaction mixtures without the test compound or with a placebo are also prepared. The formation of 
complexes is detected and the formation of complexes in the control reaction but not in the reaction mixture 

3 0 indicates that the test compound interferes with the interaction of the KLK15 Related Protein and substance. 

The reactions may be carried out in the liquid phase or the KLK15 Related Protein, substance, or test 
compound may be immobilized as described herein. The ability of a compound to modulate the biological 
activity of a KLK15 Related Protein of the invention may be tested by determining the biological effects on 
cells. 

3 5 It will be understood that the agonists and antagonists i.e. inhibitors and enhancers that can be assayed 

using the methods of the invention may act on one or more of the binding sites on the protein or substance 
including agonist binding sites, competitive antagonist binding sites, non-competitive antagonist binding sites 
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or allosteric sites. 

The invention also makes it possible to screen for antagonists that inhibit the effects of an agonist of 
the interaction of KLK15 Related Protein with a substance which is capable of binding to the KLK15 Related 
Protein. Thus, the invention may be used to assay for a compound that competes for the same binding site of 
5 a KLK15 Related Protein. 

The invention also contemplates methods for identifying compounds that bind to proteins that interact 
with a KLK15 Related Protein. Protein-protein interactions may be identified using conventional methods such 
as co-immunoprecipitation, crosslinking and co-purification through gradients or chromatographic columns. 
Methods may also be employed that result in the simultaneous identification of genes which encode proteins 
1 0 interacting with a KLK15 Related Protein. These methods include probing expression libraries with labeled 
KLK15 Related Protein. 

Two-hybrid systems may also be used to detect protein interactions in vivo. Generally, plasmids are 
constructed that encode two hybrid proteins. A first hybrid protein consists of the DNA-binding domain of a 
transcription activator protein fused to a KLK15 Related Protein, and the second hybrid protein consists of 
1 5 the transcription activator protein's activator domain fused to an unknown protein encoded by a cDNA which 
has been recombined into the plasmid as part of a cDNA library. The plasmids are transformed into a strain 
of yeast (e.g. 5. cerevisiae) that contains a reporter gene (e.g. lacZ, luciferase, alkaline phosphatase, horseradish 
peroxidase) whose regulatory region contains the transcription activator's binding site. The hybrid proteins 
alone cannot activate the transcription of the reporter gene. However, interaction of the two hybrid proteins 

2 0 reconstitutes the functional activator protein and results in expression of the reporter gene, which is detected 

by an assay for the reporter gene product. 

It will be appreciated that fusion proteins may be used in the above-described methods. In particular, 
KLK15 Related Proteins fused to a glutathione-S-transf erase may be used in the methods. 

A modulator of a KLK15 Related Protein of the invention may also be identified based on its ability 
25 to inhibit or enhance catalytic activity of the protein. 

The reagents suitable for applying the methods of the invention to evaluate compounds that modulate 
a KLK15 Related Protein may be packaged into convenient kits providing the necessary materials packaged 
into suitable containers. The kits may also include suitable supports useful in performing the methods of the 
invention. 

3 0 4.3 Compositions and Treatments 

The proteins of the invention, substances or compounds identified by the methods described herein, 
antibodies, and nucleic acid molecules of the invention may be used for modulating the biological activity of 
a KLK15 Related Protein, and they may be used in the treatment of conditions such as cancer (particularly 
thyroid, prostate, colon, kidney, testicular cancer) and thyroid disorders in a patient. 
35 Accordingly, the substances, antibodies, peptides, and compounds may be formulated into 

pharmaceutical compositions for administration to subjects in a biologically compatible form suitable for 
administration in vivo. By "biologically compatible form suitable for administration in vivo" is meant a form 
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of the active substance to be administered in which any toxic effects are outweighed by the therapeutic effects. 
The active substances may be administered to living organisms including humans and animals. Administration 
of a therapeutically active amount of a pharmaceutical composition of the present invention is defined as an 
amount effective, at dosages and for periods of time necessary to achieve the desired result. For example, a 
5 therapeutically active amount of a substance may vary according to factors such as the disease state, age, sex, 
and weight of the individual, and the ability of antibody to elicit a desired response in the individual. Dosage 
regima may be adjusted to provide the optimum therapeutic response. For example, several divided doses may 
be administered daily or the dose may be proportionally reduced as indicated by the exigencies of the 
therapeutic situation. 

1 0 The active substance may be administered in a convenient manner such as by injection (subcutaneous, 

intravenous, etc.), oral administration, inhalation, transdermal application, or rectal administration. Depending 
on the route of administration, the active substance may be coated in a material to protect the substance from 
the action of enzymes, acids and other natural conditions that may inactivate the substance. 

The compositions described herein can be prepared by per se known methods for the preparation of 

1 5 pharmaceutically acceptable compositions which can be administered to subjects, such that an effective 
quantity of the active substance is combined in a mixture with a pharmaceutically acceptable vehicle. Suitable 
vehicles are described, for example, in Remington's Pharmaceutical Sciences (Remington's Pharmaceutical 
Sciences, Mack Publishing Company, Easton, Pa., USA 1985). On this basis, the compositions include, albeit 
not exclusively, solutions of the active substances in association with one or more pharmaceutically acceptable 

2 0 vehicles or diluents, and contained in buffered solutions with a suitable pH and iso-osmotic with the 
physiological fluids. 

The compositions are indicated as therapeutic agents either alone or in conjunction with other 
therapeutic agents or other forms of treatment (e.g. chemotherapy or radiotherapy). For example, the 
compositions may be used in combination with anti-proliferative agents, antimicrobial agents, 

2 5 immunostimulatory agents, or anti-inflammatories. In particular, the compounds may be used in combination 

with anti-viral and/or anti-proliferative agents. The compositions of the invention may be administered 
concurrently, separately, or sequentially with other therapeutic agents or therapies. 

Vectors derived from retroviruses, adenovirus, herpes or vaccinia viruses, or from various bacterial 
plasmids, may be used to deliver nucleic acid molecules to a targeted organ, tissue, or cell population. Methods 

3 0 well known to those skilled in the art may be used to construct recombinant vectors which will express 

antisense nucleic acid molecules of the invention. (See, for example, the techniques described in Sambrook et 
al (supra) and Ausubel et al (supra)). 

The nucleic acid molecules comprising full length cDNA sequences and/or their regulatory elements 
enable a skilled artisan to use sequences encoding a protein of the invention as an investigative tool in sense 
3 5 (Youssoufian H and H F Lodish 1993 Mol Cell Biol 13:98-104) or antisense (Eguchi et al (1991) Annu Rev 
Biochem 60:631-652) regulation of gene function. Such technology is well known in the art, and sense or 
antisense oligomers, or larger fragments, can be designed from various locations along the coding or control 
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regions. 

Genes encoding a protein of the invention can be turned off by transfecting a cell or tissue with 
vectors which express high levels of a desired KLK15-encoding fragment. Such constructs can inundate cells 
with untranslatable sense or antisense sequences. Even in the absence of integration into the DNA, such vectors 
5 may continue to transcribe RNA molecules until all copies are disabled by endogenous nucleases. 

Modifications of gene expression can be obtained by designing antisense molecules, DNA, RNA or 
PNA, to the regulatory regions of a gene encoding a protein of the invention, i.e., the promoters, enhancers, 
and introns. Preferably, oligonucleotides are derived from the transcription initiation site, eg, between -10 and 
+10 regions of the leader sequence. The antisense molecules may also be designed so that they block translation 

10 of mRNA by preventing the transcript from binding to ribosomes. Inhibition may also be achieved using "triple 
helix" base-pairing methodology. Triple helix pairing compromises the ability of the double helix to open 
sufficiently for the binding of polymerases, transcription factors, or regulatory molecules. Therapeutic advances 
using triplex DNA were reviewed by Gee J E et al (In: Huber B E and B I Carr (1994) Molecular and 
Immunologic Approaches, Futura Publishing Co, Mt Kisco N.Y.). 

1 5 Ribozyrnes are enzymatic RNA molecules that catalyze the specific cleavage of RNA. Ribozymes act 

by sequence-specific hybridization of the ribozyme molecule to complementary target RNA, followed by 
endonucleolytic cleavage. The invention therefore contemplates engineered hammerhead motif ribozyme 
molecules that can specifically and efficiently catalyze endonucleolytic cleavage of sequences encoding a 
protein of the invention. 

2 0 Specific ribozyme cleavage sites within any potential RNA target may initially be identified by 

scanning the target molecule for ribozyme cleavage sites which include the following sequences, GUA, GUU 
and GUC. Once the sites are identified, short RNA sequences of between 15 and 20 ribonucleotides 
corresponding to the region of the target gene containing the cleavage site may be evaluated for secondary 
structural features which may render the oligonucleotide inoperable. The suitability of candidate targets may 
25 also be determined by testing accessibility to hybridization with complementary oligonucleotides using 
ribonuclease protection assays. 

Methods for introducing vectors into cells or tissues include those methods discussed herein and 
which are suitable for in vivo, in vitro and ex vivo therapy. For ex vivo therapy, vectors may be introduced into 
stem cells obtained from a patient and clonally propagated for autologous transplant into the same patient (See 

3 0 U.S. Pat. Nos. 5,399,493 and 5,437,994). Delivery by transfection and by liposome are well known in the art. 

An antibody against a KLK15 Related Protein may be conjugated to chemotherapeutic drugs, toxins, 
immunological response modifiers, hematogenous agents, enzymes, and radioisotopes and used in the 
prevention and treatment of cancer (e.g. thyroid, prostate, colon, kidney, testicular cancer). For example, an 
antibody against a KLK15 Related Protein may be conjugated to toxic moieties including but not limited to 
3 5 ricin A, diphtheria toxin, abrin, modeccin, or bacterial toxins from Pseudomonas or Shigella. Toxins and their 
derivatives have been reported to form conjugates with antibodies specific to particular target tissues, such as 
cancer or tumor cells in order to obtain specifically targeted cellular toxicity (Moolten F.L. et al, Immun. Rev. 
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62:47-72, 1982, andBernhard,, M.I. Cancer Res. 43:4420, 1983). 

Conjugates can be prepared by standard means known in the art. A number of Afunctional linking 

agents (e.g. heterobifunctional linkers such as N-succinimidyl-3-(2-pyridyldithio)propionate) are available 

commercially from Pierce Chemically Company, Rockford, 111. 
5 Administration of the antibodies or immunotoxins for therapeutic use may be by an intravenous route, 

although with proper formulation additional routes of administration such as intraperitoneal, oral, or 

transdermal administration may also be used. 

A KLK15 Related Protein may be conjugated to chemotherapeutic drugs, toxins, immunological 

response modifiers, enzymes, and radioisotopes using methods known in the art. 
1 0 The invention also provides immunotherapeutic approaches for preventing or reducing the severity 

of a cancer. The clinical signs or symptoms of the cancer in a subject are indicative of a beneficial effect to the 

patient due to the stimulation of the subject's immune response against the cancer. Stimulating an immune 

response refers to inducing an immune response or enhancing the activity of immunoeffector cells in response 

to administration of a vaccine preparation of the invention. The prevention of a cancer can be indicated by an 
1 5 increased time before the appearance of cancer in a patient that is predisposed to developing cancer due for 

example to a genetic disposition or exposure to a carcinogenic agent. The reduction in the severity of a cancer 

can be indicated by a decrease in size or growth rate of a tumor. 

Vaccines can be derived from a KLK Related Protein, peptides derived therefrom, or chemically 

produced synthetic peptides, or any combination of these molecules, or fusion proteins or peptides thereof. The 
2 0 proteins, peptides, etc. can be synthesized or prepared recombinantly or otherwise biologically, to comprise 

one or more amino acid sequences corresponding to one or more epitopes of a tumor associated protein. 

Epitopes of a tumor associated protein will be understood to include the possibility that in some instances 

amino acid sequence variations of a naturally occurring protein or polypeptide may be antigenic and confer 

protective immunity against cancer or anti-tumorigenic effects. Sequence variations may include without 

2 5 limitation, amino acid substitutions, extensions, deletions, truncations, interpolations, and combinations thereof. 

Such variations fall within the scope of the invention provided the protein containing them is immunogenic and 
antibodies against such polypeptide cross-react with naturally occurring KLK 15 Related Protein to a sufficient 
extent to provide protective immunity and/or anti-tumorigenic activity when administered as a vaccine. 

The proteins, peptides etc, can be incorporated into vaccines capable of inducing an immune response 

3 0 using methods known in the art. Techniques for enhancing the antigenicity of the proteins, peptides, etc. are 

known in the art and include incorporation into a multimeric structure, binding to a highly immunogenic protein 
carrier, for example, keyhole limpet hemocyanin (KLH), or diptheria toxoid, and administration in combination 
with adjuvants or any other enhancer of immune response. 

Vaccines may be combined with physiologically acceptable media, including immunologically 
3 5 acceptable diluents and carriers as well as commonly employed adjuvants such as Freund's Complete Adjuvant, 
saponin, alum, and the like. 

It will be further appreciated that anti-idiotype antibodies to antibodies to KLK 15 Related Proteins 
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described herein are also useful as vaccines and can be similarly formulated. 

The administration of a vaccine in accordance with the invention, is generally applicable to the 
prevention or treatment of cancers including thyroid, prostate, colon, kidney, and testicular cancer. 

The administration to a patient of a vaccine in accordance with the invention for the prevention and/or 
5 treatment of cancer can take place before or after a surgical procedure to remove the cancer, before or after a 
chemotherapeutic procedure for the treatment of cancer, and before or after radiation therapy for the treatment 
of cancer and any combination thereof. The cancer immunotherapy in accordance with the invention would be 
a preferred treatment for the prevention and /or treatment of cancer, since the side effects involved are 
substantially minimal compared with the other available treatments e.g. surgery, chemotherapy, radiation 
1 0 therapy. The vaccines have the potential or capability to prevent cancer in subjects without cancer but who are 
at risk of developing cancer. 

The activity of the proteins, substances, compounds, antibodies, nucleic acid molecules, agents^ and 
compositions of the invention may be confirmed in animal experimental model systems. Therapeutic efficacy 
and toxicity may be determined by standard pharmaceutical procedures in cell cultures or with experimental 
1 5 animals, such as by calculating the ED50 ( the dose therapeutically effective in 50% of the population) or LD 50 
(the dose lethal to 50% of the population) statistics. The therapeutic index is the dose ratio of therapeutic to 
toxic effects and it can be expressed as the ED50/LD50 ratio. Pharmaceutical compositions which exhibit large 
therapeutic indices are preferred. 
4.4 Other Applications 

2 0 The nucleic acid molecules disclosed herein may also be used in molecular biology techniques that 

have not yet been developed, provided the new techniques rely on properties of nucleotide sequences that are 
currently known, including but not limited to such properties as the triplet genetic code and specific base pair 
interactions. 

The invention also provides methods for studying the function of a polypeptide of the invention. 

2 5 Cells, tissues, and non-human animals lacking in expression or partially lacking in expression of a nucleic acid 

molecule or gene of the invention may be developed using recombinant expression vectors of the invention 
having specific deletion or insertion mutations in the gene. A recombinant expression vector may be used to 
inactivate or alter the endogenous gene by homologous recombination, and thereby create a deficient cell, 
tissue, or animal. 

3 0 Null alleles may be generated in cells, such as embryonic stem cells by deletion mutation. A 

recombinant gene may also be engineered to contain an insertion mutation that inactivates the gene. Such a 
construct may then be introduced into a cell, such as an embryonic stem cell, by a technique such as 
transfection, electroporation, injection etc. Cells lacking an intact gene may then be identified, for example 
by Southern blotting, Northern Blotting, or by assaying for expression of the encoded polypeptide using the 
3 5 methods described herein. Such cells may then be fused to embryonic stem cells to generate transgenic 
non-human animals deficient in a polypeptide of the invention. Germline transmission of the mutation may 
be achieved, for example, by aggregating the embryonic stem cells with early stage embryos, such as 8 cell 
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embryos, in vitro; transferring the resulting blastocysts into recipient females and; generating germline 
transmission of the resulting aggregation chimeras. Such a mutant animal may be used to define specific cell 
populations, developmental patterns and in vivo processes, normally dependent on gene expression. 

The invention thus provides a transgenic non-human mammal all of whose germ cells and somatic 
5 cells contain a recombinant expression vector that inactivates or alters a gene encoding a KLK15 Related 
Protein. In an embodiment the invention provides a transgenic non-human mammal all of whose germ cells 
and somatic cells contain a recombinant expression vector that inactivates or alters a gene encoding a KLK15 
Related Protein resulting in a KLK15 Related Protein associated pathology. Further, the invention provides 
a transgenic non-human mammal which does not express or has altered (e.g. reduced) expression of a KLK15 
1 0 Related Protein of the invention. In an embodiment, the invention provides a transgenic non-human mammal 
which does not express or has reduced expression of a KLK15 Related Protein of the invention resulting in a 
KLK15 Related Protein associated pathology. A KLK15 Related Protein pathology refers to a phenotype 
observed for a KLK15 Related Protein homozygous or heterozygous mutant. 

A transgenic non-human animal includes but is not limited to mouse, rat, rabbit, sheep, hamster, dog, 
1 5 cat, goat, and monkey, preferably mouse. 

The invention also provides a transgenic non-human animal assay system which provides a model 
system for testing for an agent that reduces or inhibits a pathology associated with a KLK15 Related Protein, 
preferably a KLK15 Related Protein associated pathology, comprising: 

(a) administering the agent to a transgenic non-human animal of the invention; and 
2 0 (b) determining whether said agent reduces or inhibits the pathology (e.g. KLK15 Related Protein 

associated pathology) in the transgenic non-human animal relative to a transgenic non-human 
animal of step (a) which has not been administered the agent. 
The agent may be useful in the treatment and prophylaxis of conditions such as cancer as discussed 
herein. The agents may also be incorporated in a pharmaceutical composition as described herein. 

2 5 The following non-limiting example is illustrative of the present invention: 

Example 

Materials and Methods 
Identification of the new gene 

A contiguous map for the human kallikrein gene locus extending from the KLK1 gene (centromere) 

3 0 to the KLK14 gene (telomere) (7,8,1 1,12,27) was constructed. Overlapping bacterial artificial chromosome 

(BAC) clones spanning this area were identified by screening of a human BAC library using different 
radiolabeled gene-specific probes. An area of ~ 300 kb of genomic sequence was established using different 
techniques, as previously described (1 1,27). By performing an EcoRl restriction analysis, the kallikrein locus 
was oriented along the EcoRl restriction map of chromosome 19ql3 available from the Lawrence Livermore 
3 5 National Laboratory (LLNL). A BAC clone that extends more centromerically (BC 781134) was then 
identified. Contigs of linear genomic sequences from this clone are available from the LLNL. Initially, these 
contig sequences were used to predict the presence of novel genes, using bioinformatic approaches, as 
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previously described (8,12), and a putative new serine protease was identified. The sequence of the putative 
gene was then verified by different approaches including sequencing, EST database search, PCR screening of 
tissues, as described below. 
Expressed sequence tag (EST) searching 
5 The predicted exons of the putative new gene were subjected to homology search using the BLASTN 

algorithm (28) on the National Center for Biotechnology Information web server (http://www 
ncbi.nlm.nih.gov/BLAST/) against the human EST database (dbEST). Clones with > 95% homology were 
obtained from the I.M. A.G.E. consortium (29) through Research Genetics Inc, Huntsville, AL. The clones were 
propagated, purified as described elsewhere (30) and sequenced from both directions with an automated 

1 0 sequencer, using insert-flanking vector primers. 

Prostate cancer cell line and hormonal stimulation experiments 

The LNCaP prostate cancer cell line was purchased from the American Type Culture Collection 
(ATCC), Rockville, MD. Cells were cultured in RPMI media (Gibco BRL, Gaithersburg, MD) supplemented 
with glutamine (200 mmol/L), bovine insulin (10 mg/L), fetal bovine serum (10%), antibiotics and 

1 5 antimycotics, in plastic flasks, to near conftuency. The cells were then aliquoted into 24-well tissue culture 
plates and cultured to 50% confluency. 24 hours before the experiments, the culture media were changed into 
phenol red-free media containing 10% charcoal-stripped fetal bovine serum. For stimulation experiments, 
various steroid hormones dissolved in 100% ethanol were added into the culture media at a final concentration 
of 10" 8 M. Cells stimulated with 100% ethanol were included as controls. The cells were cultured for 24 hours, 

2 0 then harvested for mRNA extraction. 

Reverse transcriptase polymerase chain reaction (RT-PCR) for the KLK15 gene 

Total RNA was extracted from the LNCaP cell line or from prostate tissues using Trizol reagent 
(Gibco BRL) following the manufacturer's instructions. RNA concentration was determined 
spectrophotometrically. 2 ug of total RNA was reverse-transcribed into first strand cDNA using the 

2 5 Superscript™ preamplification system (Gibco BRL). The final volume was 20 ul. Based on the combined 

information obtained from the predicted genomic structure of the new gene and the EST sequences (see below), 
two gene-specific primers were designed (KLK15-F1 - SEQ ID NO. 47 and KLK15-R1 - SEQ ID NO. 48 ) 
(Table 1) and PCR was carried out in a reaction mixture containing 1 ul of cDNA, 10 mM Tris-HCl (pH 8.3), 
50 mM KC1, 1.5 mM MgCl 2 , 200 uM dNTPs (deoxynucleoside triphosphates), 150 ng of primers and 2.5 units 

3 0 of HotStar™ DNA polymerase (Qiagen Inc., Valencia, CA) on a Perkin-Elmer 9600 thermal cycler. The 

cycling conditions were 95°C for 15 minutes to activate the Taq DNA polymerase, followed by 35 cycles of 
94°C for 30 s, 64°C for 30 s, 72°C for 1 min and a final extension step at 72°C for 10 min. Equal amounts of 
PCR products were electrophoresed on 2% agarose gels and visualized by ethidium bromide staining. All 
primers for RT-PCR spanned at least 2 exons to avoid contamination by genomic DNA. To verify the identity 
3 5 of the PCR products, they were cloned into the pCR 2.1-TOPO vector (Invitrogen, Carlsbad, CA, USA) 
according to the manufacturer's instructions. The inserts were sequenced from both directions using vector- 
specific primers, with an automated DNA sequencer. 
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Tissue expression 

Total RNA isolated from 26 different human tissues was purchased from Clontech, Palo Alto, CA. 
cDNA was prepared as described above for the tissue culture experiments and used for PCR reactions. Tissue 
cDNAs were amplified at various dilutions using two gene-specific primers (KLK15-F2 - SEQ ID NO. 49 and 
5 KLK15-R1 - SEQ ID NO. 48) (Table 1). Due to the high degree of homology between kallikreins, and to 
exclude non-specific amplification, PCR products were cloned and sequenced. 
Prostate cancer tissues 

Prostate tissue samples were obtained from 29 patients who had undergone radical retropubic 
prostatectomy for prostatic adenocarcinoma at the Charite University Hospital, Berlin, Germany. The patients 

1 0 did not receive any hormonal therapy before surgery. The use of these tissues for research purposes was 
approved by the Ethics Committee of the Charite Hospital. Fresh prostate tissue samples were obtained from 
the cancerous and non-cancerous parts of the same prostates that had been removed. Small pieces of tissue were 
dissected immediately after removal of the prostate and stored in liquid nitrogen until analysis. Histological 
analysis from all the tissue pieces was performed as previously described (31), to ensure that the tissue was 

15 either malignant or benign. The tissues were pulverized with a hammer under liquid nitrogen and RNA was 
extracted as described above, using Trizol reagent. 
Statistical analysis 

Statistical analysis was performed with SAS software (SAS Institute, Cary, NC). The analysis of 
differences between KLK15 expression in non-cancerous versus cancerous tissues from the same patient was 
2 0 performed with the non-parametric McNemar test. The binomial distribution was used to compute the 
significance level. Prostate tumor KLK15 mRNA levels were qualitatively classified into two categories 
(KLK15 - low and KLK15 - high groups) and associations between KLK15 status and other variables were 
analyzed using the Fisher's exact test. 
Structure analysis 

2 5 Multiple alignment was performed using the "Clustal X" software package and the multiple alignment 

program available from the Baylor College of Medicine, Houston, TX, USA. Phylo genetic studies were 
performed using the 'Thylip" software package. Distance matrix analysis was performed using the "Neighbor- 
Joining/UPGMA" program and parsimony analysis was done using the "Protpars" program. Hydrophobicity 
study was performed using the Baylor College of Medicine search launcher. Signal peptide was predicted using 

3 0 the "SignalP" server. Protein structure analysis was performed by "SAPS" (structural analysis of protein 

sequence) program. 
Results 

Cloning of the KLK15 gene 

A contiguous map for the human kallikrein gene locus extending from the KLK1 gene (centromere) 
35 to the KLK14 gene (telomere) was previously established (7,8,1 1,12,27). In order to investigate the presence 
of other kallikrein-like genes centromeric to KLK1, a BAC clone (BC 781134) was obtained as described in 
materials and methods. According to the published genomic sequence of prostate specific antigen (PSA), and 
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human renal kallikrein (KLK1) genes, gene-specific primers were designed for each of these genes (Table 1, 
SEQ ID NOs. 47-50) and polymerase chain reaction (PCR)-based amplification protocols were developed 
which allowed generation of specific PCR products with genomic DNA as a template. PGR screening of the 
BAC clone by these gene-specific primers indicated that this clone is positive for KLK1 but negative to PSA, 
5 thus, confirming its location to be centromeric to PSA. 

A putative new serine protease was predicted from the sequence of this clone by computer programs 
as previously described (12). This clone was digested, blotted on a membrane and hybridized with gene- 
specific primers for the putative KLK15 gene (according to the predicted sequence), and positive fragments 
were subcloned and sequenced to verify the structure of the putative gene. This putative gene sequence was 

10 then blasted against the human EST database and two EST clones were identified (GenBank accession 
#AW274270 and # AW205420). These two clones were 99% identical to the last exon and the 3 'untranslated 
region of the gene and the second EST ends with a stretch of 17 adenine (A) nucleotides that were not found 
in the genomic sequence, thus verifying the 3' end of the gene and the position of the poly A tail. 

To identify the full mRNA structure of the gene and to determine the exon/intron boundaries, PCR 

1 5 reactions were performed using primers located in different computer-predicted exons, using a panel of 26 
human tissue cDNAs as templates. PCR products were sequenced. Two of these primers (KLK15-F1 - SEQ 
ID NO. 47, and KLK15-R1 - SEQ ID NO. 48)) (Table 1) were able to amplify the full coding region of the 
gene from different tissues. Comparing the mRNA with the genomic structure indicated the presence of a gene 
formed of five coding exons with 4 intervening introns. Translation of the mRNA sequence in all possible 

2 0 reading frames revealed the presence of only one frame that gives an uninterrupted polypeptide chain, that also 
contains the highly conserved structural motifs of the kallikreins, as discussed below. 
Structural characterization of the KLK15 gene 

As shown in Figure 1, the KLK15 gene is formed of 5 coding exons and 4 intervening introns, 
although, as with other kallikrein genes, the presence of further upstream untranslated exon(s) could not be 

2 5 ruled out (17,32,33). All of the exon /intron splice sites conform to the consensus sequence for eukaryotic 

splice sites (34). The gene further follows strictly the common structural features of other members of the 
human kallikrein multigene family, as described below. The predicted protein-coding region of the gene is 
formed of 771 bp, encoding a deduced 256 amino acid polypeptide with a predicted molecular weight of 28.1 
kDa. The potential translation initiation codon matches the consensus Kozak sequence (35), moreover, there 

3 0 is a purine at position (-3) which occurs in 97% of vertebrate mRNAs (36). It should also be noted, that like 

most other kallikrein-like genes, KLK15 does not have the consensus G nucleotide at position (+4). 

Nucleotides 7764-7769 (ATT AAA) (SEQ ID NO. 40) closely resemble a consensus polyadenylation 
signal (37) and are followed, after 17 nucleotides, by the poly A tail. No other potential polyadenylation signals 
were discernable in the 3' untranslated region, suggesting that the above sequence is the actual polyadenylation 
3 5 signal. Although AATAAA (SEQ ID NO. 40) is highly conserved, natural variants do occur, and the ATTAAA 
(SEQ ID NO. 40) sequence is reported to occur as a natural polyadenylation variant in 12% of vertebrate 
mRNA sequences (38). The presence of glutamic acid (E) at position 203 suggests that KLK15 will likely 
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possess a unique substrate specificity. PSA has a serine (S) residue in the corresponding position and has 
chymotryptic-like activity. Many other kallikreins usually have aspartate (D) in this position indicating a 
trypsin-like activity (Figure 2) (6). 

Although the KLK15 protein sequence is unique, comparative analysis revealed that it has a 
5 considerable degree of homology with other members of the kallikrein multigene family. KLK15 shows 51% 
protein identity and 66% similarity with the trypsin like serine protease (TLSP) and 49%, 48% identity with 
the neuropsin and KLK-L3 proteins, respectively. Hydrophobicity analysis revealed that the amino-terminal 
region is quite hydrophobic (Figure 3), consistent with the possibility that this region may harbor a signal 
sequence, analogous to other serine proteases. Computer analysis of the KLK15 protein sequence predicted 

10 a cleavage site between amino acids 16 and 17 (TAA-QD). Sequence alignment (Figure 2) also revealed 
another potential cleavage site (Lys 21 ), at a site homologous to the activation site of other serine proteases 
[lysine (K) or arginine (R) is present in most cases] (39). Several evenly distributed hydrophobic regions 
throughout the KLK15 polypeptide are consistent with a globular protein, similar to other kallikreins and serine 
proteases. Thus, as is the case with other kallikreins, KLK15 is presumably translated as an inactive 256 amino 

1 5 acid preproenzyme precursor. Prepro-KLK15 has 21 additional residues which constitute the pre-region (the 
signal peptide formed of 16 residues), and the propeptide (5 residues). 

The dotted region in Figure 2 indicates an 1 1 -amino acid loop characteristic of the classical kallikreins 
(PSA, KLK1, and KLK2) but not found in KLK15 or other members of the kallikrein multi-gene family 
(10,11,13,14). However, KLK15 has a unique 8 amino acid loop (HNEPGTAG) (SEQ ID NO. 10) at positions 

2 0 148-155, not found in any other kallikrein (Figure 2). Twenty nine "invariant" amino acids surrounding the 
active site of serine proteases have been described (40). Of these, twenty eight are conserved in KLK15. One 
of the unconserved amino acids (Ser 173 instead of Pro) is also found in prostase, KLK-L2 and KLK-L5 proteins, 
and represents a conserved evolutionary change to a protein of the same group, according to protein evolution 
studies (41). Twelve cysteine residues are present in the putative mature KLK15 protein; ten of them are 

2 5 conserved in all kallikreins, and would be expected to form disulphide bridges. The other two (CI 3 1 and C243) 

are not found in PSA, KLK1, KLK2 or KLK-L4, however, they are found in similar positions in all other 
kallikrein genes and are expected to form an additional disulphide bond. 

To predict the phylogenetic relatedness of the KLK15 protein with other serine proteases, the amino 
acid sequences were aligned together using the Unweighted Pair Group Method with Arithmetic mean 

3 0 (UPGMA) and the Neighbor- Joining distance matrix methods, and the "Protpars" parsimony method. All 

phylogenetic trees obtained agreed that other serine proteases (non-kallikreins) can be grouped together as a 
separate group, indicating that kallikreins represent a separate step in the evolution of serine proteases. KLK15 
was grouped with the KLK-L3 and TLSP (Figure 4) and the classical kallikreins (hKl, hK2, and PSA) are 
grouped together in all trees, suggesting that the separation between classical kallikreins and the kallikrein-like 
3 5 genes occurred early during evolution, consistent with suggestions of previous studies (13). 
Splice variants of the KLK15 gene 

PCR screening for KLK15 transcripts using gene-specific primers (KLK15-F2 -SEQ ID NO. 49 and 



WO 02/14485 



PCT/CA01/01141 



-31 - 

KLK15-R2 - SEQ ID NO. 50) (Table 1) revealed the presence of 3 bands in most of the tissue cDNAs 
examined (Figure 6). These bands were gel purified, cloned and sequenced. The upper band represents the 
classical form of the gene, and the lower band is splice variant 3 (Figure 7). The middle band represents two 
other splice variants. Restriction digestion of the PCR product of the middle band with Stu I, followed by gel 
5 separation, purification, and sequencing revealed that it is composed of splice variants 1 and 2 which have 
approximately the same length (splice variant 1 has exon 4 (137 bp) but is missing 118 bp from exon 3, while 
splice variant 2 has an additional 118 bp of exon 3 but missing exon 4. All splice variants are expected to 
encode for truncated protein products (Figure 5). 
Chromosomal localization of the KLK15 gene 

1 0 Restriction analysis study of a number of overlapping B AC clones spanning the human kallikrein locus 

followed by comparison with the EcoRl restriction map of the area (available from the LLNL web site) enabled 
identification of a BAC clone (BC 25479) that is telomerically adjacent to BC 781134 (which harbors the 
KLK15 gene). Blasting the sequences of the two clones showed that the ends of these clones are overlapping. 
By identifying the position of the KLK1, KLK3 and KLK15 genes along these clones, the relative location and 

1 5 the direction of transcription of these three genes were precisely defined. KLK1 is the most centromeric and 
its direction of transcription is from telomere to centromere, followed by KLK15, which is more telomeric and 
transcribes in the same direction. The distance between the two genes is 1501 bp in length. The KLK3 gene 
is more telomeric, located at a distance of 23,335 from the KLK15, and is transcribed in the opposite direction 
(Figure 6). These results are consistent with previous reports where the distance between KLK3 and KLK1 was 

2 0 roughly estimated to be ~ 3 1 Kb (6,27). 

Tissue expression and hormonal regulation of the KLK15 gene 

As shown in Figure 7, the KLK15 gene is expressed at highest levels in the thyroid gland. Lower 
levels of expression are also seen in the prostate, salivary and adrenal glands, colon, testis and kidney. In order 
to verify the RT-PCR specificity, representative PCR products were cloned and sequenced. Figure 8 shows that 
25 the KLK15 gene is up-regulated by steroid hormones in the human LNCaP prostate cancer cell line. 
KLK15 expression in prostate cancer 

The expression of the KLK15 gene in normal and cancerous prostatic tissues was examined by RT- 
PCR. Actin was included as a control gene to ensure the quality and amount of the cDNA used. In order to 
examine the relative expression of the KLK15 gene in normal compared with malignant tissues, 29 pairs of 

3 0 prostatic tissues were examined. Each pair represented normal and cancerous tissue obtained from the same 

patient. The results are summarized in Table 2. Thirteen out of 29 patients had significantly higher KLK15 
expression in the cancer tissue and only three had the expression of KLK15 higher in non-cancer than to cancer 
tissues. Analysis by the McNemar test indicated that the differences between normal and cancerous tissues are 
statistically significant (P=0.021). Because of the small number of cases, the binomial distribution was used 
35 to compute the significance level. The prostate cancer patients were further classified into two groups: (a) 
KLK15 expression-positive (N= 21) and (b) KLK15 expression-negative (or very low) (N= 8). When the 
association of KLK15 expression was compared with clinicopathological prognostic variables higher KLK15 
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expression was found to be more frequent in patients with late stage disease and tumours of higher grade (Table 
3). 

Discussion 

Kallikreins are a subgroup of serine proteases. The term 'kallikrein' is usually utilized to describe an 
5 enzyme that acts upon a precursor molecule (kininogen) for release of a bioactive peptide (kinin)(3,42). 
However, the generic term 'tissue kallikrein' is not restricted to the functional definition of the enzyme. This 
term is now used to describe a group of enzymes with highly conserved gene and protein structure which also 
co-localize in the same chromosomal locus. Among the three classical human kallikrein genes, only KLK1 
encodes for a protein with potent kininogenase activity. The enzymes encoded by KLK2 and KLK3 genes have 
1 0 very weak kininogenase activity. The already cloned 14 members of the human kallikrein gene family have a 
number of similarities (7,11) as show below: 

• All genes localize to the same chromosomal region (19ql3.3-ql3.4) 

• All genes encode for putative serine proteases with a conserved catalytic triad in the appropriate positions, 
i.e., histidine near the end of the second coding exon, aspartic acid in the middle of the third exon, and serine 

15 at the beginning of the fifth (last) exon. 

• All genes have five coding exons (some members contain one or more 5/- untranslated exons). 

• Coding exon sizes are similar or identical. 

• Intron phases are fully conserved. 

• All genes have significant sequence homologies at the DNA and amino acid levels (30-80%). 
2 0 • Many of these genes are regulated by steroid hormones. 

Figures 2 and 8 show that the newly identified KLK15 gene shares all the above similarities and is 
thus a new member of the human kallikrein multigene family. This gene was named KLK15. 

Many kallikrein genes are related to the pathogenesis of human diseases, depending on the tissue of 
their primary expression. The KLK1 gene is involved in many disease processes, including inflammation (3), 

2 5 hypertension (44), renal nephritis and diabetic renal disease (45,46), The connections of HSCCE (KLK7) with 

skin diseases, including pathological keratinization and psoriasis, have already been reported (47,48). Little 
et al. suggested that zyme (KLK6) may be amyloidogenic and may play a role in the development of 
Alzheimer's disease (14). There are other reports describing connection of neuropsin (KLK8) expression with 
diseases of the central nervous system, including epilepsy (49,50). Being primarily expressed in the thyroid, 

3 0 KLK15 may play an important role in the normal physiology and pathophysiology of this gland. Among all 

other discovered kallikreins, many are expressed in the thyroid but none at highest levels in this tissue (7,1 1) 
The KLK15 gene is up-regulated, at the mRNA level, in a subset of prostate cancers. The distributions 
of KLK15 qualitative expression status (high or low) between subgroups of patients differing by disease stage, 
tumor grade and Gleason score indicated that high KLK15 expression was found more frequently in Grade 3 
3 5 tumors as well as in stage III and Gleason score >6 patients. These findings indicate that overexpression of 
KLK15 is associated with more aggressive forms of the disease and may be an indicator of poor prognosis 
(Table 3). 
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There is now growing evidence that many kallikreins and kallikrein-like genes are related to 
malignancy. PSA is the best marker for prostate cancer so far (20). Recent reports suggest that hK2 (encoded 
by the KLK2 gene) could be another useful diagnostic marker for prostate cancer (21,51). NES1 (KLK10) 
appears to be a novel tumor suppressor gene (23). The zyme (KLK6) gene was shown to be differentially 
5 expressed in primary breast and ovarian tumors (24), and the human stratum corneum chymotryptic enzyme 
(HSCCE, KLK7) has been shown to be expressed at abnormally high levels in ovarian cancer (25). Another 
recently identified kallikrein-like gene, tentatively named the tumor-associated differentially expressed gene- 14 
(TADG-14)/neuropsin) (KLK8) was found to be overexpressed in about 60% of ovarian cancer tissues (26). 
Pprostase/KLK-L 1/ (KLK4), another newly discovered kallikrein-like gene, is speculated to be linked to 

1 0 prostate cancer (13). Two newly discovered kallikreins, KLK-L4 (KLK13) and KLK-L5 (KLK12), were also 
found to be downregulated in breast cancer (10). Thus, extensive new literature suggests multiple connections 
of various kallikrein genes to many forms of human cancer. 

The existence of multiple alternatively spliced rnRNA forms is frequent among the kallikreins. 
Distinct RNA species are transcribed from the PSA gene, in addition to the major 1 .6 kb transcript (19,52,53). 

1 5 Also, Reigman et al reported the identification of two alternatively spliced forms of the human glandular 
kallikrein 2 (KLK2) gene (54). A novel transcript of the tissue kallikrein gene (KLK1) was also isolated from 
the colon (55). Neuropsin, a recently identified kallkrein-like gene, was found to have two alternatively spliced 
forms, in addition to the major form (26,56). KLK-L4 was also found to have different alternatively spliced 
forms (10). Because the splice variants of KLK15 have an identical 5' sequence required for translation, 

2 0 secretion and activation, it is possible to assume that they encode for a secreted protein (53). 

In conclusion, a new member of the human kallikrein gene family, KLK15, has been characterized 
which maps to the human kallikrein locus (chromosome 19ql3.3-ql3.4). This gene has three related splice 
forms in addition to the classical form. KLK15 is expressed in a variety of tissues but predominantly in the 
thyroid, it appears to be up-regulated in more aggressive forms of prostate cancer and its expression is 

2 5 influenced by steroid hormones. Since a few other kallikreins are already used as valuable tumor markers, 

KLK15 may also find similar clinical applications. 

Having illustrated and described the principles of the invention in a preferred embodiment, it should 
be appreciated to those skilled in the art that the invention can be modified in arrangement and detail without 

3 0 departure from such principles. All modifications coming within the scope of the following claims are claimed. 

All publications, patents and patent applications referred to herein are incorporated by reference in 
their entirety to the same extent as if each individual publication, patent or patent application was specifically 
and individually indicated to be incorporated by reference in its entirety. 
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Table 1. Primers used for genomic PGR amplification. 



Gene 


















GenBank 






Primer name and sequence 






accession # 


KLK1 


KLK1-A: 


ATC 


CCT 


CCA 


TTC 


CCA 


TCT 


TT 


L10038 




KLK1-B 


CAC 


ATA 


CAA 


TTC 


TCT 


GGT 


TC 




KLK2 


KLK2-A: 


AGT 


GAG 


ACT 


GTC 


TCA 


GAA 


TT 


M18157 




KLK2-B: 


CCC 


CAA 


TCT 


CAC 


GAG 


TGC 


AC 




PSA 


E5-A: 


GTC 


GGC 


TCT 


GGA 


GAC 


ATT 


TC 


M27274 




E5-B: 


AAC 


TGG 


GGA 


GGC 


TTG 


AGT 


C 




KLK15 


KLK15-F1 


CTC 


CTT 


CCT 


GCT 


GGC 


ATC 


CA 


AF242195 




KLK15-R1 


ATC 


ACA 


CGG 


GTG 


GTC 


ATG 


TG 






KLK15-F2 


CAA 


GTG 


GCT 


CTC 


TAC 


GAG 


CG 






KLK15-R2 


GAG 


ACC 


AGG 


CTT 


GGT 


GGT 


GT 





* all primers are presented in 5' — ► 3' direction. 
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Table 2. KLK15 expression in 29 pairs of cancerous and non -cancerous prostatic tissues. 


KLK15 Expression 


Number of patients P value * 


Higher in cancer vs. normal 


13 


Lower in cancer vs. normal 


3 


High expression but approx. equal in both 


8 


tissues 




Low (or no) expression but approx. equal in 


5 0.021 


both tissues 





P value was calculated by the McNemar test using the binomial distribution. 
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3. Relationship between KLK15 expression and other clinicopathological 
variables in 29 patients with primary prostate cancer 



Van ahlp 


Jratients 


No. of patients f%") 
KLK15 negative KLK15 positive 


P 










value* 


Stage 










i/ii 
in 

Grade 


20 
9 


8(40) 
0(0) 


12 (60) 
9 (100) 


0.033 


Gl/2 
G3 

Gleason score 


23 
6 


8 (34.8) 
0(0) 


15 (65.2) 
6 (100) 


0.15 


<6 
>6 

Unknown 


22 
6 
1 


7 (31.8) 
0(0) 


15 (62.2) 
6 (100) 


0.14 



*Fisher's Exact Test. 
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We Claim: 



1. An isolated KLK15 nucleic acid molecule of at least 30 nucleotides which hybridizes to one or more 
of SEQ. ID. NO. 1 through 5 or 10 through 24, or the complement of one or more of SEQ ID NO. 1 

5 through 5 or 10 through 24 under stringent hybridization conditions. 

2. An isolated nucleic acid molecule which comprises: 

(i) a nucleic acid sequence encoding a protein having substantial sequence identity with an amino 
acid sequence of SEQ. ID. NO. 6, 7, 8, or 9; 

(ii) a nucleic acid sequence encoding a protein comprising an amino acid sequence of SEQ. ID, 
10 NO. 6, 7, 8, or 9; 

(iii) nucleic acid sequences complementary to (i) or (ii); 

(iv) a degenerate form of a nucleic acid sequence of (i) or (ii); 

(v) a nucleic acid sequence capable of hybridizing under stringent conditions to a nucleic acid 
sequence in (i), (ii) or (iii); 

1 5 (vi) a nucleic acid sequence encoding a truncation, an analog, an allelic or species variation of 

a protein comprising an amino acid sequence of SEQ. ID. NO. 6, 7, 8, or 9; or 
(vii) a fragment, or allelic or species variation of (i), (ii) or (iii). 

3. An isolated nucleic acid molecule which comprises: 

(i) a nucleic acid sequence comprising the sequence of one or more of SEQ.ID.NO. 1 through 
2 0 5 or 10 through 24, wherein T can also be U; 

(ii) nucleic acid sequences complementary to (i), preferably complementary to the full nucleic 
acid sequence of one or more of SEQ.ID.NO. 1 through 5 or 10 through 24; 

(iii) a nucleic acid capable of hybridizing under stringent conditions to a nucleic acid of (i) or (ii) 
and preferably having at least 18 nucleotides; or 

2 5 (iv) a nucleic acid molecule differing from any of the nucleic acids of (i) to (iii) in codon 

sequences due to the degeneracy of the genetic code. 

4. A vector comprising a nucleic acid molecule of any of the preceding claims. 

5. A host cell comprising a nucleic acid molecule of any of the preceding claims. 

6. An isolated protein comprising an amino acid sequence of SEQ. ID. NO. 6, 7, 8, or 9. 

3 0 7. A method for preparing a protein as claimed in claim 6 comprising: 

(a) transferring a vector as claimed in claim 4 into a host cell; 

(b) selecting transformed host cells from untransformed host cells; 

(c) culturing a selected transformed host cell under conditions which allow expression of the protein; 
and 

3 5 (d) isolating the protein. 

8. A protein prepared in accordance with the method of claim 7. 

9. An antibody having specificity against an epitope of a protein as claimed in claim 6. 
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10. An antibody as claimed in claim 9 labeled with a detectable substance and used to detect the polypeptide 
in biological samples, tissues, and cells. 

11. A probe comprising a sequence encoding a protein as claimed in claim 6, or a part thereof. 

12. A method of diagnosing and monitoring a condition associated with a protein as claimed in claim 6 by 
5 determining the presence of a nucleic acid molecule as claimed in any of the preceding claims or a protein 

as claimed in any of the preceding claims. 

13. A method as claimed in claim 12 wherein the condition is cancer. 

14. A method for identifying a substance which associates with a protein as claimed in claim 6 comprising 

(a) reacting the protein with at least one substance which potentially can associate with the protein, 
1 0 under conditions which permit the association between the substance and protein, and 

(b) removing or detecting protein associated with the substance, wherein detection of associated 
protein and substance indicates the substance associates with the protein. 

15. A method for evaluating a compound for its ability to modulate the biological activity of a protein as 
claimed in claim 6 comprising providing the protein with a substance which associates with the protein 

1 5 and a test compound under conditions which permit the formation of complexes between the substance 

and protein, and removing and/or detecting complexes. 

16. A method for identifying inhibitors of a KLK15 Related Protein interaction, comprising 

(a) providing a reaction mixture including the KLK15 Related Protein and a substance that binds 
to the KLK15 Related Protein, or at least a portion of each which interact; 
2 0 (b) contacting the reaction mixture with one or more test compounds; 

(c) identifying compounds which inhibit the interaction of the KLK15 Related Protein and 
substance. 

17. A method for detecting a nucleic acid molecule encoding a protein comprising an amino acid sequence 
of SEQ. ID. NO. 6, 7, 8, or 9 in a biological sample comprising the steps of: 

2 5 (a) hybridizing a nucleic acid molecule of claim 1 to nucleic acids of the biological sample, 

thereby forming a hybridization complex; and 
(b) detecting the hybridization complex wherein the presence of the hybridization complex 
correlates with the presence of a nucleic acid molecule encoding the protein in the 
biological sample. 

3 0 18. A method as claimed in claim 17 wherein nucleic acids of the biological sample are amplified by the 

polymerase chain reaction prior to the hybridizing step. 
19. A method for monitoring the progression of cancer in an individual, comprising: 

(a) contacting an amount of an antibody which binds to a KLK15 Related Protein, with a sample from 
the individual so as to form a binary complex comprising the antibody and KLK15 Related Protein 

35 in the sample; 

(b) determining or detecting the presence or amount of complex formation in the sample; 

(c) repeating steps (a) and (b) at a point later in time; and 
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(d) comparing the result of step (b) with the result of step (c), wherein a difference in the amount of 
complex formation is indicative of the progression of cancer in said individual. 
20. A method for treating a condition mediated by a protein as claimed in claim 6 comprising administering 
an effective amount of a compound identified in accordance with a method claimed in claim 15 or 16. 
5 21 . A method as claimed in claim 20 wherein the condition is cancer. 

22. A composition comprising one or more of a nucleic acid molecule or protein claimed in any of the 
preceding claims, or a substance or compound identified using a method as claimed in any of the 
preceding claims, and a pharmaceutically acceptable carrier, excipient or diluent. 

23. Use of one or more of a nucleic acid molecule or protein claimed in any of the preceding claims, or a 
10 substance or compound identified using a method as claimed in any of the preceding claims in the 

preparation of a pharmaceutical composition for treating a condition mediated by a protein as claimed in 
claim 6, or a nucleic acid molecule as claimed in claim 1. 

24. A method of conducting a drug discovery business comprising: 

(a) providing one or more assay systems for identifying agents by their ability to inhibit or 
15 potentiate the interaction of a KLK15 Related Protein and a substance that binds to the 

KLK15 Related Protein; 

(b) conducting therapeutic profiling of agents identified in step (a), or further analogs thereof, 
for efficacy and toxicity in animals; and 

(c) formulating a pharmaceutical preparation including one or more agents identified in step (b) 
2 0 as having an acceptable therapeutic profile. 

25. A vaccine for stimulating or enhancing in a subject to whom the vaccine is administered production of 
antibodies directed against a protein as claimed in claim 6. 

26. A method for stimulating or enhancing in a subject production of antibodies directed against a protein as 
claimed in claim 6. 

25 27. A method as claimed in claim 26 comprising administering to the subject a vaccine as claimed in claim 
25 in a dose effective for stimulating or enhancing production of the antibodies. 
28. A method for treating, preventing, or delaying recurrence of cancer comprising administering to the subject 
a vaccine as claimed in claim 25 in a dose effective for treating, preventing, or delaying recurrence of 
cancer. 

30 
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Figure 1 
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Figure 1 Cont'd 



CGCCTGACATGGAACAGAACGGAGCCATCCCCCAAGACCCTGTCCAAGGCCCAGATGTTAGCCAAGG 
ACTTGTCCCACCTGAGGACAAAGCTGGCGCTCAAGGTCACCTGTTTAATGCCAAGATAACAAAGCGC 
TGATCCAAGTTGCTCTGTAGGAATTTCTGTGACTTTTTTCTGGGGTCAAAGAGAAACCCCGAGACAC 
TGTACACTGTTCCTTTTCACCCACCACCCCGATCCCTAGGTGAGGAGAAGCGGCTTGAAGCAGGGCT 
CCATTCATTCAACACACATGACCACCCGTGTGATCTTGAACAAGAGGCCCAATCTCACTTCGCCTTG 
GTTTCCTTATCTGTAAAATGAGACCATCTTATTGCTGACTTCAAAGGGCTGTTGTGAGGATTAAATG 
AGATGATTCGTCTGAACTGATTAAAATCGTGTCTGGCACTGA 
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Figure 2 
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Figure 4 
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SEQUENCE LISTING 



SEQ ID NO 1 



KLK15 full genomic structure 



1 agaatgggtg ctgtgggatt caggggagac 
61 ggtgggggca gagtgtcaga ggacaaagat 
121 caattcaccc tcaaagcttc ctgaggcctc 
181 ccagagtgga gtcagaccag ctcctcgggg 
241 cactgagggt caggggaggt cagagaaggc 
3 01 ggcgttggat atgcggtggg aggaatagcc 
3 61 agcactggct gcgtctagga caaggtcatg 
421 gaagccactt ttgtccttga aagtgaggct 
481 gcactttggg aggctgaggc gggtggatca 
541 ccaacatggt gaaactccgt ctctactaaa 
601 acacctgtaa tcccaattgc ttgggaggct 
661 cagaggttac agtgagcgga gatcacgcca 
721 ctccgtctca aaaaaaaaaa aaaaaaagaa 
781 tggactttat cctggtggtg ccaaggatcc 
841 gccagctcca gatgtagaaa gaccctttgg 
901 ggactggact ggagggggac ccaaaaggcc 
9 61 gtggctcatg cctgtaatcc cagcactttg 
1021 agagattgag gccatcctgg ctaacacggt 
1081 tttccaaaaa attagccggg cacggtggtg 
1141 tgaggcggga gaatggtgtg aacctgggag 
12 01 actgcactcc agcctgggtg acagagtgag 

12 61 gacagggggt ccttgcgtga tgatggagag 
1321 ggctgacagg tggaggaggt ggggcagggt 

13 81 ttccagagat gggggaccca gaaggaagcg 
1441 tggtcagaag aaatttatta acagtggatg 
1501 ggaggcagaa gggtcctgcc tcacctggat 
1561 accctggtcc ctcctccaca atgtggcttc 
1621 caggtgaggt ggccccagga gggggccagg 
1681 atgtctgggc ccagtgatct gccagcccct 
1741 cttcaccctc cacctgtgcc aatgtcccta 
1801 ctctggagtc gctatgttgc ctggggcctc 
1861 gggttacaca aacctgagct gagtcctggg 
1921 ggaactgctt cacctctctg ggcttcgaat 
1981 acaataacag tgaggtctca atagcataac 
2041 agtgctgaga ggccaacagc acagacagac 
2101 tgtctcttac agggtctttg aggggattaa 
2161 cctggcccag gtagtgcccc ttcaggtgtg 
2221 acaacgtcta tgtgtggtgg gcaccaccga 
2281 cagagggacg gccacaacat caaggtcacc 
2341 cagctgccac acaccaagaa attaattgta 
2401 gggtggactt tcattaacgt tctgccacaa 
2461 ccccacgcct ccagcctccc ctccggctct 
2521 tgcctgctgc gcccagctgg ggcctccacc 
2581 ctcggtgccc agctgttcag cccgtaccct 

2 641 tttgttaagc cccttgtcca ccacatccgc 
27 01 cagttcctga ctgtgatggg gagagtgtgg 
27 61 ctgtgtgggt gggagtgtgc atggcttcaa 
2821 ggagaggggt cctggtcctg tgaccttgaa 
2881 cattctgtgt gtgtgtgtgt gtgtgtgtgt 
2941 gccctgtatg ttcacatcct ctccacatgc 

3 001 tcagtgactc cacagatctg gcttgggggc 
3 061 ctggctgcct ctggcttcag ctgcatgtct 
3121 gtgctgcccc agacagcaag tccacatctg 
3181 ctctaactct acaaaatgtc ttgtgggtga 
3241 ttgtttggtt ttttttgaga cggagtctcg 
33 01 caacctcagc tcactgcaac ctccgcctcc 
33 61 gagtagctag aattacaggc acgcaccacc 
3421 tttatttatt tattttttag tagagacggg 
3481 actcctgacc tcaggtgatc cacccacctc 
3541 aggcgtgagc caccacacct ggccaacctt 
3601 tgtctgactt atgttggcca tctccaggat 



acctgttagg tgttggggcc tcccagaaga 
gaatttggaa gatatgggga agaaggattt 
ccgtgggtcg ggccctgcag tactggagac 
agctgccagt ctcgtagggg aggcagacac 
ctcaaggagg aagcggggct ggaagggaat 
taagcatgaa atggcaggag ggaaaatggc 
ggagacccag ggagaggggc tggaagggaa 
ggagccaggc aactcatgcc tgtaatccca 
ctagaggtca ggagttcaag accagcctgg 
attacaaaaa ttagctgggc gtggtggcac 
gaggcaggag aatctcttga acccagaagg 
ctccactcca acctgggcta cagagccaga 
aaaaaaagaa agaaagtgaa tttgaagagc 
atggagggtg gtgagcaggg gaggggcaca 
ggtcatggct ggagggcaag ctggtggagg 
agataagagg gttgagatag accaggcgcg 
ggaggccgag gtgggtggat catgaagtca 
gaaaccctgt ctctacttaa aaaaaaaaaa 
ggcgcctgta gtcccagcta ctcgggaggc 
gtggagcttg cagtgagccg acattgtgcc 
actccgtctc aaaaaaataa aaaaagttgg 
agatccaccc gctggtagca tggtgctgga 
ctgtccgagt gcctagagga agagtaaacc 
cagagtgggg ttgggggaag gggataccgg 
ggataagtct gtgtctggag ggatcctggt 
tctctcactc cctccccaga ctgcagccga 
tcctcactct ctccttcctg ctggcatcca 
tctgtgggag caggtgcccc cttcccaagc 
acctcaccca gagaccacta aagatccttc 
agcccttacc gtcaggtgct ggtgctgctg 
tcgctgccca cgacaaggaa cacggtcctg 
gcaaccgctt ccttgctgtg tgtccttgag 
gccttctcta taagacagca cccacttgag 
agaggtaata tacatagcaa gcattagaca 
tccagcttga gtcccacacc tgccactccc 
atgtggttgt gtgtgaggca gaagcataag 
caagccaggc acggtgctta gagcttacat 
cctcatttga caagggaagg ggctgtggct 
ttgggtgtca ggcaaactcc agattgaact 
acctgatgcc tctcttctgg agaaattggg 
atgaccctca ctcctggggg cccctgagac 
ctctgtgcac tcacctacct gcctcgcgcc 
ttcctctggc ttggactggc caggtgcagc 
ccgcccttcg gaggacgacc tcacccttcc 
attcccctgg tctcacgggg gcctttggcc 
gcatttggtc tggctgtgca aatcctgccc 
ccttcagggg atgcatccac attgcccagt 
tgtctctaat catgtcctta agcataatgc 
gtacatgcac gtgtgcagtg ggtatacaag 
atgagccaga tccccatatg tgaaacccaa 
tgatctagag atggataaat atgtcctgcc 
ttgaccttga atgcccagcc ccgtgtctgg 
agtgttggcc ttctgggttg gtgtctgcag 
atcacggttt taaccttgac ttttttttgt 
ctctgccgcc caagctggag ttcagtggtg 
caggttcaag caattctgtc tctgcctccc 
acgcccagct gatttttgta tttttattta 
atttcacgat gttggccagg ctggtctcaa 
ggccttggcc tcccaaagtg ctgggattac 
gactatttat tataggtaat tctgtgcaga 
ggacctgaac tttcacacgt atgtccctgt 
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3661 
3721 
3781 
3841 
3901 
3961 
4021 
4081 
4141 
4201 
4261 
4321 
4381 
4441 
4501 
4561 
4621 
4681 
4741 
4801 
4861 
4921 
4981 
5041 
5101 
5161 
5221 
5281 
5341 
5401 
5461 
5521 
5581 
5641 
5701 
5761 
5821 
5881 
5941 
6001 
6061 
6121 
6181 
6241 
6301 
6361 
6421 
6481 
6541 
6601 
6661 
6721 
6781 
6841 
6901 
6961 
7021 
7081 
7141 
7201 
7261 
7321 
7381 
7441 
7501 
7561 
7621 



gactaaatcc 
aggtatcact 
cttacaacaa 
actcaaagtt 
ccctgtatgc 
aggtctgaac 
tattcatgtc 
tttttttttt 
caatggagca 
ctcagcctcc 
tatttttgtt 
caggtgatcc 
cccagcctgt 
tccagtcctg 
atgaccttgt 
agactctggg 
gacaatttgg 
gtgcccctgt 
ctctctctcc 
cagcggttgt 
gcactttggg 
taacatggtg 
cacctcagct 
gcagtgagct 
caaaaacaac 
aaggcatagg 
cgtccctgtc 
gctggaaggt 
tggacgcttt 
ctgccaaagc 
agagctccta 
cgaggaggcc 
tctctgaagg 
agaatcctgg 
tctaggagtg 
ggcttcaggg 
tcgggttgcc 
tccactggcg 
aacctgcgca 
ccgcgctacg 
gcacgcctga 
gaggcctgtg 
agcccccggt 
atgctccagg 
ccactgtctt 
tcggacacat 
ggcgcggagg 
gaagaagagg 
ggccagagaa 
ggtgaggttg 
tagttagttt 
ggtgggaaat 
catgaagatt 
ggatgtgggc 
attggatata 

tgggttggct 

tgggatgggg 
atccaggagg 
acccctggtc 
caacaccacc 
aaccatgaag 
ccccacagct 
cctgtccaag 
caaggtcacc 
ttctgtgact 
cacccaccac 
aacacacatg 



aggtgtcatt 
caccatacat 
tcttcatttt 
tcacagctag 
aagtgtctgt 
aatatccaat 
ttaaccatcc 
tttttttttt 
acctccgctc 
agagtagctg 
agagacaggg 
acccgcctcg 
gctgtgattc 
tccatagctg 
atagccacat 
gccaaggctg 
tgaccgtgaa 
ctccaccccc 
cagagtctta 
aaataaacaa 
aggctgaggc 
agacctcgtc 
actcgggagg 
gagatcatgc 
aacaacaaca 
catatagtag 
ctcaggcatc 
gacgagtgtg 
aactgtggcg 
cggtatgaag 
gatttggggg 
ggatgtcaag 
aggaaggaga 
attcggggac 
tgcctgactt 
tcttgggaaa 
cactctttga 
ggaaaaccac 
agcgcgatgg 
aagcgcgcag 
acccccaggt 
tggtgtctgg 
cacaaggtgc 
gctcttgggc 
ccccagtgag 
cttgtgacaa 
gcagaggcgc 
gatggggaca 
gatgctaggg 
gagttggggt 
gagatggcat 
acgtcagggt 
gagattggat 
tgagttggat 
ggttgggtga 
ctgtttggga 
atggattggg 
tttcactcaa 
tgtgggggca 
aagcctggtg 
aggaactgac 
ggc cage age 
geccagatgt 
tgtttaatgc 
tttttctggg 
cccgatccct 
accacccgtg 
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tgcaaaaaac 
acacacatgc 
acagatgagg 
tacattcgaa 
gacactggat 
tctgtgtgtg 
atattcatat 
tttgagatgg 
actgeaatet 
ggattacagg 
tttccccata 
gcctcccaaa 
ttgaagctgc 
tacctaagtg 
ctgggactca 
ggtcacacat 
tgactggttt 
aaccacagag 
tagcaaatgg 
cagggcaggc 
gggcagagca 
tctactaaaa 
ctgaggcagg 
cactgcactc 
aaacaaaaaa 
ttagggcagg 
ctctacccct 
caccccactc 
cttccctcat 
gcaggggctc 
aagaeggagg 
cccctgggtt 
agactagttc 
agaccaggag 
cagactegtt 
aggtaatggg 
tctttctgtc 
tcgcccgcac 
cccagagcaa 
ccaccgcaac 
gcgccccgcg 
ctggggcctg 
gtgaaaggat 
ggaggggaca 
tctcccagat 
gagctaccca 
agaatcctgt 
ggtgtgggag 
ttaggcttgg 
tataggtggg 
gggttggggt 
tgaattggga 
tttgagatgg 
ttaacttagt 
gttgtattga 
taaactgggc 
tttggggtga 
cctgagaccc 
tectgeaggg 
tctataccaa 
tattctagee 
cccgcctgac 
tagecaagga 
caagataaca 
gtcaaagaga 
aggtgaggag 
tgatcttgaa 



aactaatatt 
acacacacac 
aaacagaggc 
cccaggctta 
gccaagactc 
tgtttgtgtg 
acacatatga 
agtttcactc 
ccgcctcccg 
cacccgccac 
ttggccaggc 
gtgctgggat 
aacccatgtg 
tggaagctgg 
tactgcacac 
actgagtggc 
caagtgacca 
gaettcttge 
ggtgggggct 
ggagcatggt 
tttgaagtca 
atacaaaaat 
aggatcactt 
cagcctgggc 
cagggcaggg 
gtgcaaggaa 
tctcttagca 
ccagccatgg 
ctccccacac 
agggtcctga 
cagacgccag 
aggaaggagt 
cagcttttga 
ggggctggga 
ggtccttgag 
atgtcgagat 
ctctacttgc 
agcttcatga 
ctacggacca 
gacatcatgt 
gtgctaccca 
gtgtcccaca 
ggagctggat 
agggceggaa 
aegttgeatt 
gggcgcctga 

gaggtcagag 
tccggatggg 
agatggagta 
gattgcgttg 
tgagaatggg 
tgaggtagat 
gcatgggttt 
acagttgeae 
gctgtgttga 
tgtattgagt 
gattgeaaat 
cctcttttcc 
cattgtgtcc 
agtctgccac 
tatctcctgt 
atggaacaga 
cttgtcccac 
aagegctgat 
aaccccgaga 
aagcggcttg 
caagaggccc 



attaagtagc 
atacacattc 
acagacaggt 
aggacccatc 
acactagaga 
tgcatgtgtg 
acatctgtgc 
ttgtcaccca 
ggttcaagcg 
catgcccagc 
tggtctcgaa 
tacaggcatg 
catgeaagtg 
atgtgcatgt 
tgaatttggc 
cacatgegtt 
cctgtctgaa 
cctctggtct 
agagttctgg 
ggctcacacc 
gaagtttgag 
tagecaggtg 
gaacccagga 
aaaagagtga 
tgtcttgaga 
ggtgtaggag 
geccaggatg 
caagtggctc 

tgggtgctgt 
gggagcctgg 
aactcctggg 
gtgtgtttca 
gcctcagttc 
gtagttggag 
gagcaggggc 
ttctaaaggg 
gggtaaccac 
gagtgegect 
cgtctcgggt 
tgctgcgcct 
cgcgttgccc 
aegagectgg 
gcgaggcctc 
tttatggatc 
gtgecaacat 
caaacaccat 
cctagagggg 
gttggatttt 
ggaagagaag 
tttgaggtgg 
aatggtttgg 
tttgtttgga 
gatttgattt 
tggagttgca 
attggggttg 
tgagttgggt 
ggtgattagg 
ccacagggtg 
tggggtgacg 
tacttggagt 
gcccctgact 
aeggagecat 
ctgaggacaa 
ecaagttget 
cactgtacac 
aagcagggct 
aatctcactt 



taccagggct 
ctacctcatc 
cgaataactt 
tttgtccaga 
tgttgaattt 
tgtgtgtatg 
tgtgattctt 
ggctggagtg 
attttcctgc 
taattttttg 
ctcctgacct 
agccaccgtg 
aatttcagct 
atgcatgtcc 
tgacatgtcc 
tgacgtctgt 
cctgtatcca 
gttccccttc 
agaaaacagg 
tgtaatccca 
actacctggc 
tggtggcggg 
ggcggaagtt 
gactccgtct 
agttagggga 
gcaatgtaaa 
gtgacaagtt 
tetacgageg 
ctgcggccca 
ttcgggggga 
ttctgaaaga 
aagecttega 
tagggatgtg 
gggatcgagt 
tggaaccatt 
tegggagace 
tggcccgcac 
gggagagcac 
cattccacac 
agtccagccc 
ccacccgggg 
gaccgctggg 
aaggaatcct 
tgctccaagt 
cagcattatc 
ggtgtgtgca 
ccatcaggcg 
etttgetttg 
ttagaatagg 
ataactgtga 
tttgattctg 
atgeagaaga 
tgaatggtga 
tgggggtgag 
gggttggggt 

tggggttccc 

atgaggatga 
actctggggg 
tcccttgtga 
ggatcaggga 
gagcagaagc 
cccccaagac 
agetggeget 
ctgtaggaat 
tgttcctttt 
ccattcattc 
cgccttggtt 
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7 681 tccttatctg taaaatgaga ccatcttatt gctgacttca aagggctgtt gtgaggatta 

7741 aatgagatga ttcgtctgaa ctgattaaaa tcgtgtctgg cactgagtaa ataccctcta 

7 801 tctctggatc ccagttaaag gacctaacag acactagatt accaagaatg gctttttctt 

7 861 taaggtttag ttctgggccg ggcatggtgg ctcacacctg taatcccagc actttgggag 

7921 gccaaggcgg gcggctcact tgaggtcagg agtgcaagac cagcctggcc aacatggtga 

7981 aaccccatct ctactaaaaa tactaaaaaa atttagccgg gcgtggtggc acacgactgt 

8041 aatcctagct acttgggagg gtgatgtggg aggatcgctt gaacttagga ggcaggagtt 

8101 gcagtgagcc gagatcgcgc cactgcactc cagcctggtg acagagcaag actccatctc 

8161 agaaaaaaaa aaaaaaaaaa aaagatttag ttctgggctt cctggtagcc atggcaaaaa 

8221 ggcaaatact gtcctttcct tagccaggtc cctgatatac agcagaggct ggaactctga 

82 81 gctgctttga ttttaccaaa aagccaagac aacctgttgg aagcctatgg gtttaccatt 

8341 gaggctgcag gaatctagtt cctaattatc ttcagagacc acaaaatgtg atgttcaagg 

8401 tcgctgaatg ttgaagtaca tgaacctggc tcgtgagacc taaatattgt actggtggtg 

8461 ggggggaagg gtcattggaa tctgtggtta gcctgatctt gacctgcgag ggaaggttgt 

8521 ccagatctct ggactttgga ggaccgacgt tgagcaccat aatgggagca gaagtgcgag 

8581 gtctttgaga ccccgcttgt tggggcggcg ccggatttgg atgctaaaaa ttacctggga 

8641 accctgaata catctgggtt gggcgcacaa tgtgtggctc cccacacatc tttaggaaca 

87 01 catttgggca acccggtggg agtgaacggc ctggc 



SEQ ID NO 2 

Classic mRNA (1581. .1623, 5259. .5412, 5913. .6196, 6317. .6453) 



ATGTGGCTTCTCCTCACTCTCTCCTTCCTGCTGGCATCCACAGCAGCCCAGGATGGTGACAAGTTGCTGGAAGGTG 
ACGAGTGTGCACCCCACTCCCAGCCATGGCAAGTGGCTCTCTACGAGCGTGGACGCTTTAACTGTGGCGCTTCCCT 
CATCTCCCCACACTGGGTGCTGTCTGCGGCCCACTGCCAAAGCCGCTTCATGAGAGTGCGCCTGGGAGAGCACAAC 
CTGCGCAAGCGCGATGGCCCAGAGCAACTACGGACCACGTCTCGGGTCATTCCACACCCGCGCTACGAAGCGCGCA 
GCCACCGCAACGACATCATGTTGCTGCGCCTAGTCCAGCCCGCACGCCTGAACCCCCAGGTGCGCCCCGCGGTGCT 
ACCCACGCGTTGCCCCCACCCGGGGGAGGCCTGTGTGGTGTCTGGCTGGGGCCTGGTGTCCCACAACGAGCCTGGG 
ACCGCTGGGAGCCCCCGGTCACAAGTGAGTCTCCCAGATACGTTGCATTGTGCCAACATCAGCATTATCTCGGACA 
CATCTTGTGACAAGAGCTACCCAGGGCGCCTGACAAACACCATGGTGTGTGCAGGCGCGGAGGGCAGAGGCGCAGA 
ATCCTGTGAGGGTGACTCTGGGGGACCCCTGGTCTGTGGGGGCATCCTGCAGGGCATTGTGTCCTGGGGTGACGTC 
CCTTGTGACAACACCACCAAGCCTGGTGTCTATACCAAAGTCTGCCACTACTTGGAGTGGATCAGGGAAACCATGA 
AGAGGAACrGACTATTCTAGCCTATCTCCTGTGCCCCTGACTGAGCAGAAGCCCCCACAGCTGGCCAGCAGCCCCG 
CCTGACATGGAACAGAACGGAGCCATCCCCCAAGACCCTGTCCAAGGCCCAGATGTTAGCCAAGGACTTGTCCCAC 
CTGAGGACAAAGCTGGCGCTCAAGGTCACCTGTTTAATGCCAAGATAACAAAGCGCTGATCCAAGTTGCTCTGTAG 
GAATTTCTGTGACTTTTTTCTGGGGTCAAAGAGAAACCCCGAGACACTGTACACTGTTCCTTTTCACCCACCACCC 
CGATCCCTAGGTGAGGAGAAGCGGCTTGAAGCAGGGCTCCATTCATTCAACACACATGACCACCCGTGTGATCTTG 
AACAAGAGGCCCAATCTCACTTCGCCTTGGTTTCCTTATCTGTAAAATGAGACCATCTTATTGCTGACTTCAAAGG 
GCTGTTGTGAGGATTAAATGAGATGATTCGTCTGAACTGATTAAAATCGTGTCTGGCACTGA 

SEQ ID NO 3 

KLK 15 mRNA SPLICE VARIANT 1 structure (1581.. 1623, 5259.. 5412, 5913.. 6078, 
6317.. 6453) 

ATGTGGCTTCTCCTCACTCTCTCCTTCCTGCTGGCATCCACAGCAGCCCAGGATGGTGACAAGTTGCTGGAAGGTG 
ACGAGTGTGCACCCCACTCCCAGCCATGGCAAGTGGCTCTCTACGAGCGTGGACGCTTTAACTGTGGCGCTTCCCT 
CATCTCCCCACACTGGGTGCTGTCTGCGGCCCACTGCCAAAGCCGCTTCATGAGAGTGCGCCTGGGAGAGCACAAC 
CTGCGCAAGCGCGATGGCCCAGAGCAACTACGGACCACGTCTCGGGTCATTCCACACCCGCGCTACGAAGCGCGCA 
GCCACCGCAACGACATCATGTTGCTGCGCCTAGTCCAGCCCGCACGCCTGAACCCCCAGTGAGTCTCCCAGATACG 
TTGCATTGTGCCAACATCAGCATTATCTCGGACACATCTTGTGACAAGAGCTACCCAGGGCGCCTGACAAACACCA 
TGGTGTGTGCAGGCGCGGAGGGCAGAGGCGCAGAATCCTGTGAGGGTGACTCTGGGGGACCCCTGGTCTGTGGGGG 
CATCCTGCAGGGCATTGTGTCCTGGGGTGACGTCCCTTGTGACAACACCACCAAGCCTGGTGTCTATACCAAAGTC 
TGCCACTACTTGGAGTGGATCAGGGAAACCATGAAGAGGAACTGACTATTCTAGCCTATCTCCTGTGCCCCTGACT 
GAGCAGAAGCCCCCACAGCTGGCCAGCAGCCCCGCCTGACATGGAACAGAACGGAGCCATCCCCCAAGACCCTGTC 
CAAGGCCCAGATGTTAGCCAAGGACTTGTCCCACCTGAGGACAAAGCTGGCGCTCAAGGTCACCTGTTTAATGCCA 
AGATAACAAAGCGCTGATCCAAGTTGCTCTGTAGGAATTTCTGTGACTTTTTTCTGGGGTCAAAGAGAAACCCCGA 
GACACTGTACACTGTTCCTTTTCACCCACCACCCCGATCCCTAGGTGAGGAGAAGCGGCTTGAAGCAGGGCTCCAT 
TCATTCAACACACATGACCACCCGTGTGATCTTGAACAAGAGGCCCAATCTCACTTCGCCTTGGTTTCCTTATCTG 
TAAAATGAGACCATCTTATTGCTGACTTCAAAGGGCTGTTGTGAGGATTAAATGAGATGA 



SEQ ID NO 4 
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KLK 15 mRNA SPLICE 2 structure (1581.. 1623, 5259.. 5412, 5913.. 6196, 
7127 . .7786) 

ATGTGGCTTCTCCTCACTCTCTCCTTCCTGCTGGCATCCACAGCAGCCCAGGATGGTGACAAGTTGCTGGAAGGTG 
ACGAGTGTGCACCCCACTCCCAGCCATGGCAAGTGGCTCTCTACGAGCGTGGACGCTTTAACTGTGGCGCTTCCCT 
CATCTCCCCACACTGGGTGCTGTCTGCGGCCCACTGCCAAAGCCGCTTCATGAGAGTGCGCCTGGGAGAGCACAAC 
CTGCGCAAGCGCGATGGCCCAGAGCAACTACGGACCACGTCTCGGGTCATTCCACACCCGCGCTACGAAGCGCGCA 
GCCACCGCAACGACATCATGTTGCTGCGCCTAGTCCAGCCCGCACGCCTGAACCCCCAGGTGCGCCCCGCGGTGCT 
ACCCACGCGTTGCCCCCACCCGGGGGAGGCCTGTGTGGTGTCTGGCTGGGGCCTGGTGTCCCACAACGAGCCTGGG 
ACCGCTGGGAGCCCCCGGTCACAAGGGTGACTCTGGGGGACCCCTGGTCTGTGGGGGCATCCTGCAGGGCATTGTG 
TCCTGGGGTGACGTCCCTTGTGACAACACCACCAAGCCTGGTGTCTATACCAAAGTCTGCCACTACTTGGAGTGGA 
TC AGGGAAAC C ATGAAGAGGAAC TGACTATTC T AGC CT ATC T C C TGTGC C CC TGAC TGAGC AGAAG CC CC C AC AGC 
TGGCCAGCAGCCCCGCCTGACATGGAACAGAACGGAGCCATCCCCCAAGACCCTGTCCAAGGCCCAGATGTTAGCC 
AAGGACTTGTCCCACCTGAGGACAAAGCTGGCGCTCAAGGTCACCTGTTTAATGCCAAGATAACAAAGCGCTGATC 
CAAGTTGCTCTGTAGGAATTTCTGTGACTTTTTTCTGGGGTCAAAGAGAAACCCCGAGACACTGTACACTGTTCCT 
TTTCACCCACCACCCCGATCCCTAGGTGAGGAGAAGCGGCTTGAAGCAGGGCTCCATTCATTCAACACACATGACC 
ACCCGTGTGATCTTGAACAAGAGGCCCAATCTCACTTCGCCTTGGTTTCCTTATCTGTAAAATGAGACCATCTTAT 
TGCTGACTTCAAAGGGCTGTTGTGAGGATTAAATGAGATGA 



SEQ ID NO 5 

KLK 15 mRNA SPLICE 3 structure (1581.. 1623, 5259.. 5412, 5913.. 6078, 
7127 . .7786) 

ATGTGGCTTCTCCTCACTCTCTCCTTCCTGCTGGCATCCACAGCAGCCCAGGATGGTGACAAGTTGCTGGAAGGTG 
ACGAGTGTGCACCCCACTCCCAGCCATGGCAAGTGGCTCTCTACGAGCGTGGACGCTTTAACTGTGGCGCTTCCCT 
CATCTCCCCACACTGGGTGCTGTCTGCGGCCCACTGCCAAAGCCGCTTCATGAGAGTGCGCCTGGGAGAGCACAAC 
CTGCGCAAGCGCGATGGCCCAGAGCAACTACGGACCACGTCTCGGGTCATTCCACACCCGCGCTACGAAGCGCGCA 
GCCACCGCAACGACATCATGTTGCTGCGCCTAGTCCAGCCCGCACGCCTGAACCCCCAGGGTGACTCTGGGGGACC 
CCTGGTCTGTGGGGGCATCCTGCAGGGCATTGTGTCCTGGGGTGACGTCCCTTGTGACAACACCACCAAGCCTGGT 
GTC T ATAC C AAAGT CTGC C AC TAC TTGGAGTGG ATC AGGGAAAC C ATGAAGAGGAAC TGAC TATTC TAGCC T ATC T 
CCTGTGCCCCTGACTGAGCAGAAGCCCCCACAGCTGGCCAGCAGCCCCGCCTGACATGGAACAGAACGGAGCCATC 
CCCCAAGACCCTGTCCAAGGCCCAGATGTTAGCCAAGGACTTGTCCCACCTGAGGACAAAGCTGGCGCTCAAGGTC 
ACCTGTTTAATGCCAAGATAACAAAGCGCTGATCCAAGTTGCTCTGTAGGAATTTCTGTGACTTTTTTCTGGGGTC 
AAAGAGAAAC C C C GAGAC AC TGT AC ACTGTTC C TTTTC AC C C AC C AC CCCGATC C CTAGGTGAGGAG AAGCGGC TT 
GAAGCAGGGCTCCATTCATTCAACACACATGACCACCCGTGTGATCTTGAACAAGAGGCCCAATCTCACTTCGCCT 
TGGTTTCCTTATCTGTAAAATGAGACCATCTTATTGCTGACTTCAAAGGGCTGTTGTGAGGATTAAATGAGATGA 



SEQ ID NO 6 
KLK15 protein 

MWLLLTLSFLLASTAAQDGDKLLEGDECAPHSQPWQVALYERGRFNCGASLISPHW\^ 

LRKRDGPEQLRTTSRVIPHPRYEARSHI^DIMLIiRLVQPARLNPQVRPAVLPTRCPHPGEACWSGWGLVSHNEPG 
TAGSPRSQVSLPDTLHCANISIISDTSCDKSYPGRLTNTWCAGAEGRGAESCEGDSGGPLVCGGILQGIVSWGDV 
PCDNTTKPGVYTKVCHYLEWIRETMKRN 



SEQ ID NO 7 

KLK15 splice variant 1 

MWLLLTLSFLLASTAAQDGDKLLEGDECAPHSQPWQVALYERGRFNCGASLISPHWVLSAAHCQSRFMRVRLGEHN 
LRKRDGPEQLRTTSRVIFHPRYEARSHRNDIMLLRLVQPARLNPQ 



SEQ ID NO 8 

KLklS splice variant 2 

MWLLLTLSFLLASTAAQDGDKLLEGDECAPHSQPWQVALYERGRFNCGASLISPHWVLSAAHCQSRFMRVRLGEHN 
LRKRDGPEQLRTTSRVIPHPRYEARSHRNDIMLLRLVQPARLNPQVTLPAVTiPTRCPHPGEACWSGWGLVSHNEPG 
TAGSPRSQG 
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SEQ ID NO 9 



KLkl5 splice variant 3 
MWLLLTLSFLLASTAAQDGDKLLEG 

LRKRDGPEQLRTTSRVIPHPRYEARSHRNDIMDDRLVQPARLNPQGDSGGPLVCGGILQGIVSWGDVPCDNTTKPG 
VYTKVCHYLEWIRETMKRN 



SEQ ID NO 10 

HNEPGTAG 

SEQ ID NO 11 

5 ' Untranslated 

1-1580 

AGAATGGGTGCTGTGGGATTCAGGGGAGACACCTGTTAGGTGTTGGGGCC 
TCCCAGAAGAGGTGGGGGCAGAGTGTCAGAGGACAAAGATGAATTTGGAA 
GATATGGGGAAGAAGGATTTCAATTCACCCTCAAAGCTTCCTGAGGCCTC 
CCGTGGGTCGGGCCCTGCAGTACTGGAGACCCAGAGTGGAGTCAGACCAG 
CTCCTCGGGGAGCTGCCAGTCTCGTAGGGGAGGCAGACACCACTGAGGGT 
CAGGGGAGGTCAGAGAAGGCCTCAAGGAGGAAGCGGGGCTGGAAGGGAAT 
GGCGTTGGATATGCGGTGGGAGGAATAGCCTAAGCATGAAATGGCAGGAG 
GGAAAATGGCAGCACTGGCTGCGTCTAGGACAAGGTCATGGGAGACCCAG 
GGAGAGGGGCTGGAAGGGAAGAAGCCACTTTTGTCCTTGAAAGTGAGGCT 
GGAGCCAGGCAACTCATGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGC 
GGGTGGATCACTAGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGT 
GAAACTCCGTCTCTACTAAAATTACAAAAATTAGCTGGGCGTGGTGGCAC 
ACACCTGTAATCCCAATTGCTTGGGAGGCTGAGGCAGGAGAATCTCTTGA 
ACCCAGAAGGCAGAGGTTACAGTGAGCGGAGATCACGCCACTCCACTCCA 
ACCTGGGCTACAGAGCCAGACTCCGTCTCAAAAAAAAAAAAAAAAAAGAA 
AAAAAAAGAAAGAAAGTGAATTTGAAGAGCTGGACTTTATCCTGGTGGTG 
CCAAGGATCCATGGAGGGTGGTGAGCAGGGGAGGGGCACAGCCAGCTCCA 
GATGTAGAAAGACCCTTTGGGGTCATGGCTGGAGGGCAAGCTGGTGGAGG 
GGAC TGGAC TGG AGGGGG AC CC AAAAGGCC AGATAAGAGGGTTG AG ATAG 
ACCAGGCGCGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCGAG 
GTGGGTGGATCATGAAGTCAAGAGATTGAGGCCATCCTGGCTAACACGGT 
GAAACCCTGTCTCTACTTAAAAAAAAAAAATTTCCAAAAAATTAGCCGGG 
CACGGTGGTGGGCGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCGGGA 
GAATGGTGTGAACCTGGGAGGTGGAGCTTGCAGTGAGCCGACATTGTGCC 
ACTGCACTCCAGCCTGGGTGACAGAGTGAGACTCCGTCTCAAAAAAATAA 
AAAAAGTTGGGACAGGGGGTCCTTGCGTGATGATGGAGAGAGATCCACCC 
GCTGGTAGCATGGTGCTGGAGGCTGACAGGTGGAGGAGGTGGGGCAGGGT 
C TGTC C GAGTGC C TAGAGGAAG AGTAAAC C TTC C AGAG ATGGGGGAC C C A 
GAAGGAAGCGCAGAGTGGGGTTGGGGGAAGGGGATACCGGTGGTCAGAAG 
AAATTTATTAACAGTGGATGGGATAAGTCTGTGTCTGGAGGGATCCTGGT 
GGAGGCAGAAGGGTCCTGCCTCACCTGGATTCTCTCACTCCCTCCCCAGA 
CTGCAGCCGAACCCTGGTCCCTCCTCCACA 

SEQ ID NO 12 

Exon 1 

1581-1623 

ATGTGGCTTCTCCTCACTCTCTCCTTCCTGCTGGCATCCACAG 
SEQ ID NO 13 



Intron 1 
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1624-5258 

GTGAGGTGGCCCCAGGAGGGGGCCAGGTCTGTGGGAGCAGGTGCCCCCTTCCCAAGCATGTCTGGGCCCAGTGATC 
TGCCAGCCCCTACCTCACCCAGAGACCACTAAAGATCCTTCCTTCACCCTCCACCTGTGCCAATGTCCCTAAGCCC 
TTACCGTCAGGTGCTGGTGCTGCTGCTCTGGAGTCGCTATGTTGCCTGGGGCCTCTCGCTGCCCACGACAAGGAAC 
ACGGTCCTGGGGTTACACAAACCTGAGCTGAGTCCTGGGGCAACCGCTTCCTTGCTGTGTGTCCTTGAGGGAACTG 
CTTCACCTCTCTGGGCTTCGAATGCCTTCTCTATAAGACAGCACCCACTTGAGACAATAACAGTGAGGTCTCAATA 
GCATAACAGAGGTAATATACATAGCAAGCATTAGACAAGTGCTGAGAGGCCAACAGCACAGACAGACTCCAGCTTG 
AGTCCCACACCTGCCACTCCCTGTCTCTTACAGGGTCTTTGAGGGGATTAAATGTGGTTGTGTGTGAGGCAGAAGC 
ATAAGCCTGGCCCAGGTAGTGCCCCTTCAGGTGTGCAAGCCAGGCACGGTGCTTAGAGCTTACATACAACGTCTAT 
GTGTGGTGGGCACCACCGACCTCATTTGACAAGGGAAGGGGCTGTGGCTCAGAGGGACGGCCACAACATCAAGGTC 
ACCTTGGGTGTCAGGCAAACTCCAGATTGAACTCAGCTGCCACACACCAAGAAATTAATTGTAACCTGATGCCTCT 
CTTCTGGAGAAATTGGGGGGTGGACTTTCATTAACGTTCTGCCACAAATGACCCTCACTCCTGGGGGCCCCTGAGA 
CCCCCACGCCTCCAGCCTCCCCTCCGGCTCTCTCTGTGCACTCACCTACCTGCCTCGCGCCTGCCTGCTGCGCCCA 
GCTGGGGCCTCCACCTTCCTCTGGCTTGGACTGGCCAGGTGCAGCCTCGGTGCCCAGCTGTTCAGCCCGTACCCTC 
CGCCCTTCGGAGGACGACCTCACCCTTCCTTTGTTAAGCCCCTTGTCCACCACATCCGCATTCCCCTGGTCTCACG 
GGGGCCTTTGGCCCAGTTCCTGACTGTGATGGGGAGAGTGTGGGCATTTGGTCTGGCTGTGCAAATCCTGCCCCTG 
TGTGGGTGGGAGTGTGCATGGCTTCAACCTTCAGGGGATGCATCCACATTGCCCAGTGGAGAGGGGTCCTGGTCCT 
GTGACCTTGAATGTCTCTAATCATGTCCTTAAGCATAATGCCATTCTGTGTGTGTGTGTGTGTGTGTGTGTGTACA 
TGCACGTGTGCAGTGGGTATACAAGGCCCTGTATGTTCACATCCTCTCCACATGCATGAGCCAGATCCCCATATGT 
GAAACCCAATCAGTGACTCCACAGATCTGGCTTGGGGGCTGATCTAGAGATGGATAAATATGTCCTGCCCTGGCTG 
CCTCTGGCTTCAGCTGCATGTCTTTGACCTTGAATGCCCAGCCCCGTGTCTGGGTGCTGCCCCAGACAGCAAGTCC 
ACATCTGAGTGTTGGCCTTCTGGGTTGGTGTCTGCAGCTCTAACTCTACAAAATGTCTTGTGGGTGAATCACGGTT 
TTAACCTTGACTTTTTTTTGTTTGTTTGGTTTTTTTTGAGACGGAGTCTCGCTCTGCCGCCCAAGCTGGAGTTCAG 
TGGTGCAACCTCAGCTCACTGCAACCTCCGCCTCCCAGGTTCAAGCAATTCTGTCTCTGCCTCCCGAGTAGCTAGA 
ATTACAGGCACGCACCACCACGCCCAGCTGATTTTTGTATTTTTATTTATTTATTTATTTATTTTTTAGTAGAGAC 
GGGATTTCACGATGTTGGCCAGGCTGGTCTCAAACTCCTGACCTCAGGTGATCCACCCACCTCGGCCTTGGCCTCC 
CAAAGTGCTGGGATTACAGGCGTGAGCCACCACACCTGGCCAACCTTGACTATTTATTATAGGTAATTCTGTGCAG 
ATGTCTGACTTATGTTGGCCATCTCCAGGATGGACCTGAACTTTCACACGTATGTCCCTGTGACTAAATCCAGGTG 
T C ATTTGC AAAAAAC AACTAAT ATT ATTAAGTAGC T AC C AGGGCTAGGT ATC AC TC AC C ATAC AT AC AC AC ATGC A 
CACACACACATACACATTCCTACCTCATCCTTACAACAATCTTCATTTTACAGATGAGGAAACAGAGGCACAGACA 
GGTC GAAT AACTTAC TC AAAGTTTC AC AGC T AGT AC ATTC GAAC C C AGGC TT AAGG AC C C ATCTTTGT C C AG AC C C 
TGTATGCAAGTGTCTGTGACACTGGATGCCAAGACTCACACTAGAGATGTTGAATTTAGGTCTGAACAATATCCAA 
TTCTGTGTGTGTGTTTGTGTGTGCATGTGTGTGTGTGTATGTATTCATGTCTTAACCATCCATATTCATATACACA 
TATGAACATCTGTGCTGTGATTCTTTTTTTTTTTTTTTTTTTTTTTTTGAGATGGAGTTTCACTCTTGTCACCCAG 
GCTGGAGTGCAATGGAGCAACCTCCGCTCACTGCAATCTCCGCCTCCCGGGTTCAAGCGATTTTCCTGCCTCAGCC 
TCCAGAGTAGCTGGGATTACAGGCACCCGCCACCATGCCCAGCTAATTTTTTGTATTTTTGTTAGAGACAGGGTTT 
CCCCATATTGGCCAGGCTGGTCTCGAACTCCTGACCTCAGGTGATCCACCCGCCTCGGCCTCCCAAAGTGCTGGGA 
TTACAGGCATGAGCCACCGTGCCCAGCCTGTGCTGTGATTCTTGAAGCTGCAACCCATGTGCATGCAAGTGAATTT 
CAGCTTCCAGTCCTGTCCATAGCTGTACCTAAGTGTGGAAGCTGGATGTGCATGTATGCATGTCCATGACCTTGTA 
TAGCCACATCTGGGACTCATACTGCACACTGAATTTGGCTGACATGTCCAGACTCTGGGGCCAAGGCTGGGTCACA 
CATACTGAGTGGCCACATGCGTTTGACGTCTGTGACAATTTGGTGACCGTGAATGACTGGTTTCAAGTGACCACCT 
GTCTGAACCTGTATCCAGTGCCCCTGTCTCCACCCCCAACCACAGAGGACTTCTTGCCCTCTGGTCTGTTCCCCTT 
CCTCTCTCTCCCAGAGTCTTATAGCAAATGGGGTGGGGGCTAGAGTTCTGGAGAAAACAGGCAGCGGTTGTAAATA 
AACAACAGGGCAGGCGGAGCATGGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGCAGAGCAT 
TTG AAGTC AGAAGTTTG AG AC T AC C TGGC T AAC ATGGTGAGAC C TC GTC TCTAC TAAAAATAC AAAAATT AGC C AG 
GTGTGGTGGCGGGCACCTCAGCTACTCGGGAGGCTGAGGCAGGAGGATCACTTGAACCCAGGAGGCGGAAGTTGCA 
GTGAGCTGAGATCATGCCACTGCACTCCAGCCTGGGCAAAAGAGTGAGACTCCGTCTCAAAAACAACAACAACAAC 
AAAACAAAAAACAGGGCAGGGTGTCTTGAGAAGTTAGGGGAAAGGCATAGGCATATAGTAGTTAGGGCAGGGTGCA 
AGGAAGGTGTAGGAGGCAATGTAAACGTCCCTGTCCTCAGGCATCCTCTACCCCTTCTCTTAG 

SEQ ID NO 14 

Exon 2 

5259-5412 

CAGCCCAGGATGGTGACAAGTTGCTGGAAGGTGACGAGTGTGCACCCCACTCCCAGCCATGGCAAGTGGCTCTCTA 
CGAGCGTGGACGCTTTAACTGTGGCGCTTCCCTCATCTCCCCACACTGGGTGCTGTCTGCGGCCCACTGCCAAAGC 
CG 

SEQ ID NO 15 
Intron 2 



5413-5912 
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GTATGAAGGCAGGGGCTCAGGGTCCTGAGGGAGCCTGGTTCGGGGGGAAGAGCTCCTAGATTTGGGGGAAGACGGA 
GGCAGACGCCAGAACTCCTGGGTTCTGAAAGACGAGGAGGCCGGATGTCAAGCCCCTGGGTTAGGAAGGAGTGTGT 
GTTTCAAAGCCTTCGATCTCTGAAGGAGGAAGGAGAAGACTAGTTCCAGCTTTTGAGCCTCAGTTCTAGGGATGTG 
AGAATCCTGGATTCGGGGACAGACCAGGAGGGGGCTGGGAGTAGTTGGAGGGGATCGAGTTCTAGGAGTGTGCCTG 
ACTTCAGACTCGTTGGTCCTTGAGGAGCAGGGGCTGGAACCATTGGCTTCAGGGTCTTGGGAAAAGGTAATGGGAT 
GTCGAGATTTCTAAAGGGTCGGGAGACCTCGGGTTGCCCACTCTTTGATCTTTCTGTCCTCTACTTGCGGGTAACC 
ACTGGCCCGCACTCCACTGGCGGGAAAACCACTCGCCCGCACAG 



SEQ ID NO 16 

Exon 3 (Classic and Splice Variant 2) 



5913-6196 

cttcatga gagtgcgcct gggagagcac aacctgcgca agcgcgatgg cccagagcaa ctacggacca 
cgtctcgggt cattccacac ccgcgctacg aagcgcgcag ccaccgcaac gacatcatgt tgctgcgcct 
agtccagccc gcacgcctga acccccaggt gcgccccgcg gtgctaccca cgcgttgccc ccacccgggg 
gaggcctgtg tggtgtctgg ctggggcctg gtgtcccaca acgagcctgg gaccgctggg 
agcccccggt cacaag 

SEQ ID NO 17 

Intron 3 - Classic and Spice Variant 2 
6197-6316 

tgc gtgaaaggat ggagctggat gcgaggcctc aaggaatccta tgctccaggg ctcttgggcg 
gaggggaca agggccggaa tttatggatc, tgctccaagtccactgtctt ccccag 



SEQ ID NO 18 

Exon 3 - (Splice variant 1 and 3) 
5913-6078 

CTTCATGAGAGTGCGCCTGGGAGAGCACAACCTGCGCAAGCGCGATGGCCCAGAGCAACTACGGACCACGTCTCGG 
GTCATTCCACACCCGCGCTACGAAGCGCGCAGCCACCGCAACGACATCATGTTGCTGCGCCTAGTCCAGCCCGCAC 
GCCTGAACCCCCAGGTGCGCCCCGCGGTGCTACCCACGCGTTGCCCCCACCCGGGGGAGGCCTGTGTGGTGTCTGG 
CTGGGGCCTGGTGTCCCACAACGAGCCTGGGACCGCTGGGAGCCCCCGGTCACAAG 



SEQ ID NO 19 

Intron 3 - (Splice variant 1 and 3) 
6079-6316 

GTGCGTGAAAGGATGGAGCTGGATGCGAGGCCTCAAGGAATCCTATGCTCCAGGGCTCTTGGGCGGAGGGGACAAG 
GGCCGGAATTTATGGATCTGCTCCAAGTCCACTGTCTTCCCCAG 

SEQ ID NO 20 

Exon 4 

6317-6453 (Classic and Splice Variant 1) 

TGAGTCTCCCAGATACGTTGCATTGTGCCAACATCAGCATTATCTCGGACACATCTTGTGACAAGAGCTACCCAGG 
GCGCCTGACAAACACCATGGTGTGTGCAGGCGCGGAGGGCAGAGGCGCAGAATCCTGTGAG 

SEQ ID NO 21 

Intron 4 (Classic and Splice Variant 1) 
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6454-7126 

GTCAGAGCCTAGAGGGGCCATCAGGCGGAAGAAGAGGGATGGGGACAGGTGTGGGAGTCCGGATGGGGTTGGATTT 
TCTTTGCTTTGGGCCAGAGAAGATGCTAGGGTTAGGCTTGGAGATGGAGTAGGAAGAGAAGTTAGAATAGGGGTGA 
GGTTGGAGTTGGGGTTATAGGTGGGGATTGCGTTGTTTGAGGTGGATAACTGTGATAGTTAGTTTGAGATGGCATG 
GGTTGGGGTTGAGAATGGGAATGGTTTGGTTTGATTCTGGGTGGGAAATACGTCAGGGTTGAATTGGGATGAGGTA 
GATTTTGTTTGGAATGCAGAAGACATGAAGATTGAGATTGGATTTTGAGATGGGCATGGGTTTGATTTGATTTTGA 
ATGGTGAGGATGTGGGCTGAGTTGGATTTAACTTAGTACAGTTGCACTGGAGTTGCATGGGGGTGAGATTGGATAT 
AGGTTGGGTGAGTTGTATTGAGCTGTGTTGAATTGGGGTTGGGGTTGGGGTTGGGTTGGCTCTGTTTGGGATAAAC 
TGGGCTGTATTGAGTTGAGTTGGGTTGGGGTTCCCTGGGATGGGGATGGATTGGGTTTGGGGTGAGATTGCAAATG 
GTGATTAGGATGAGGATGAATCCAGGAGGTTTCACTCAACCTGAGACCCCCTCTTTTCCCCACAG 



SEQ ID NO 22 

Intron 4 (Splice variant 2 and 3) 
6079-7126 

T GCGCCCCGCG GTGCTACCCA CGCGTTGCCC CCACCCGGGG GAGGCCTGTG TGGTGTCTGG 
CTGGGGCCTG GTGTCCCACA ACGAGCCTGG GACCGCTGGG AGCCCCCGGT CACAAGGTGC GTGAAAGGAT 
GGAGC TGGAT GCGAGGCCTC AAGGAATC CTATGC TC C AGG GCTCTTGGGC GGAGGGGACA AGGGCCGGAA 
TTTATGGATC TGCTCCAAGT CCACTGTCTT CCCCAGTGAG TCTCCCAGAT ACGTTGCATT GTGCCAACAT 
CAGCATTATC TCGGACACAT CTTGTGACAA GAGCTACCCA GGGCGCCTGA CAAACACCAT GGTGTGTGCA 
GGCGCGGAGG GCAGAGGCGC AGAATCCTGT GAGGTCAGAG CCTAGAGGGG CCATCAGGCG GAAGAAGAGG 
ATGGGGACA GGTGTGGGAG TCCGGATGGG GTTGGATTTT CTTTGCTTTG GGCCAGAGAA GATGCTAGGG 
TTAGGCTTGG AGATGGAGTA GGAAGAGAAG TTAGAATAGG GGTGAGGTTG GAGTTGGGGT TATAGGTGGG 
GATTGCGTTG TTTGAGGTGG ATAACTGTGA TAGTTAGTTT GAGATGGCAT GGGTTGGGGT TGAGAATGGG 
AATGGTTTGG TTTGATTCTG GGTGGGAAAT ACGTCAGGGT TGAATTGGGA TGAGGTAGAT TTTGTTTGGA 
ATGCAGAAGA CATGAAGATT GAGATTGGAT TTTGAGATGG GCATGGGTTT GATTTGATTT TGAATGGTGA 
GGATGTGGGC TGAGTTGGAT TTAACTTAGT ACAGTTGCAC TGGAGTTGCA TGGGGGTGAG ATTGGATATA 
GGTTGGGTGA GTTGTATTGA GCTGTGTTGA ATTGGGGTTG GGGTTGGGGT TGGGTTGGCT CTGTTTGGGA 
TAAACTGGGC TGTATTGAGT TGAGTTGGGT TGGGGTTCCC TGGGATGGGG ATGGATTGGG TTTGGGGTGA 
GATTGCAAAT GGTGATTAGG ATGAGGATGA ATCCAGGAGG TTTCACTCAA CCTGAGACCC CCTCTTTTCC 
CCACAG 



SEQ ID NO 23 
Exon 5 
7127-7786 

GGTGACTCTGGGGGACCCCTGGTCTGTGGGGGCATCCTGCAGGGCATTGTGTCCTGGGGTGACGTCCCTTGTGACA 
ACACCACCAAGCCTGGTGTCTATACCAAAGTCTGCCACTACTTGGAGTGGATCAGGGAAACCATGAAGAGGAACTG 
ACTATTCTAGCCTATCTCCTGTGCCCCTGACTGAGCAGAAGCCCCCACAGCTGGCCAGCAGCCCCGCCTGACATGG 
AACAGAACGGAGCCATCCCCCAAGACCCTGTCCAAGGCCCAGATGTTAGCCAAGGACTTGTCCCACCTGAGGACAA 
AGCTGGCGCTCAAGGTCACCTGTTTAATGCCAAGATAACAAAGCGCTGATCCAAGTTGCTCTGTAGGAATTTCTGT 
GACTTTTTTCTGGGGTCAAAGAGAAACCCCGAGACACTGTACACTGTTCCTTTTCACCCACCACCCCGATCCCTAG 
GTGAGGAGAAGCGGCTTGAAGCAGGGCTCCATTCATTCAACACACATGACCACCCGTGTGATCTTGAACAAGAGGC 
CCAATCTCACTTCGCCTTGGTTTCCTTATCTGTAAAATGAGACCATCTTATTGCTGACTTCAAAGGGCTGTTGTGA 
GGATTAAATGAGATGATTCGTCTGAACTGATTAAAATCGTGTCTGGCACTGA 



SEQ ID NO 24 
7127-7279 

GGTGACTCTGGGGGACCCCTGGTCTGTGGGGGCATCCTGCAGGGCATTGTGTCCTGGGGTGACGTCCCTTGTGACA 
ACACCACCAAGCCTGGTGTCTATACCAAAGTCTGCCACTACTTGGAGTGGATCAGGGAAACCATGAAGAGGAACTG 
A 



SEQ ID NO. 25 
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Zyme 

MKKLMWLSLIAAAWAEEQ 

LRQRESSQEQSSVVRAVIHPDYDAASHDQDIMLLRLARPAKLSELIQPLPLERDCSANTTSCHILGWGKTADGDFP 
DTIQCAYIHLVSREECEHAYPGQITQNMLCAGDEKYGKDSCQGDSGGPLVCGDHLRGLVSWGNIPCGSKEKPGVyT 
NVCRYTNWIQKTIQAK 



SEQ ID NO. 26 
KLK-L4 

lyMPLALVIASLTLALSGGVSQESSKVLNTNGTSGFLPGGYTCFPHSQPWQAALLVQGRLLCGGVLVHPKWVLTAM 
CLKEGLKVYLGKHALGRVEAGEQVREVVHSIPHPEYRRSPTHIjNHDHDIML^ 

PGTTCRVSGWGTTTSPQVNYPKTLQCANIQLRSDEECRQWPGKITDl^LCAGTKEGGKDSCEGDSGGPWCimTL 
YGIVSWGDFPCGQPDRPGVYTRVSRYVLWIRETIRKYETQQQKWLKGPQ 

SEQ ID NO. 27 

KLK-L6 

MFLLLTALQVLAI AMTQSQEDENKI IGGHTCTRS S QPWQAALiL A 

GPRRRFLCGGALLSGQWVITAAHCGRPILQVALGKHNLRRWEATQQVLRVVRQVTHPNY 

ARIGRAVRPIEVTQACASPGTSCRVSGWGTISSPIARYPASIiQCVNINISPDEVCQKAYPRTITPGMVCAGVPQGG 
KDSCQGDSGGPLVCRGQLQGLVSWGMERCALPGYPGVYTNLCKYRSWIEETMRDK 

SEQ ID NO. 28 

TLSP 

MQRLRWLRDWKSSGRGLTAAKEPGARSSPLQAMRILQLILLALATGLVGGETRIIKGFECKPHSQPWQAALFEKTR 
LLCGATLIAPRWLLTAAHCLKPRYIVHLGQHNLQKEEGCEQTRTATESFPHPGFNNSLPNKDHRNDIMLVKMASPV 
SITWAVRPLTLSSRCVTAGTSCLISGWGSTSSPQLRLPHTLRCANITIIEHQKCENAYPGNITDTMVCASVQEGGK 
DSCQGDSGGPLVCNQSLQGIISWGQDPCAITRKPGVYTKVCKYVDWIQE 



SEQ ID NO. 29 
KLK-L3 

MKLGLLCALLSLLAGHGWADTRAIGAEECRPNSQPWQAGLFHLTRLFCGATLISDRWLLTAAHCRKPYLWVRLGEH 
HLWKWEGPEQLFRVTDFFPHPGFNKDLSANDHNDDIMLIRLPRQARLSPAVQPLNLSQTCVSPGMQCLISGWGAVS 
SPKALFPVTLQCANISILENKLCHWAYPGHISDSMLCAGLWEGGRGSCQGDSGGPLVCNGTLAGWSGGAEPCSRP 
RRP AVYT S VCH YLDW I QE IMEN 



SEQ ID NO. 30 



NES1 



MRAPHLHLSAASGARALAKLLPLLMAQLWAAEAALLPQNDTRLDPEAYGAPCARG SQPWQVSLFNGLSFH 
CAGVLVDQSWVLTAAHCGNKPLWARVGDDH LL-LLQG-EQLRRTT RSWHPKYHQGSGPI LPRRTDEHDLML 
LKLARPW- PGPRVR ALQLPYR-CAQPGDQ C QVAGWGTT AARRVK YNKGLTCSSITILSP 
KECEVFYPGWTNNM ICAGLDR-GQDPCQS DSGGPDVCDETLQGI LSWG- 
VYPCGSAQHPAVYTQICKYMSWINK VIRSN 



SEQ ID NO. 31 
KLK-L5 



MGLSIFLLLCVLGLSQAATPKIFNGTECGRNSQPWQVGLFEGTSLRCGGVLIDHRWVL.TAAHCSGSRYWVRLGEHS 
LSQbDWTEQIRHSGFSVTHPGYLGASTSHEHDLRLLRLRLPVRVTSSVQPLPLPNDCATAGTECHVSGWGITNHPR 
NPFPDLLQCLNLSIVSHATCHGWPGRITSNIWCAGGVPGQDACQGDSGGPLVCGGVLQGLVSWGSVGPCGQDGIP 
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GVYTYICKYVDWIRMIMRNN 



SEQ ID NO. 32 
Neuropsin 

MGRPRPRAAKTWMFLLLLGGAWAGHSRAQEDKVLGGHECQPHSQPWQ^^ 

PKYTTOLGDHSLQNKDGPEQEIPWQSIPHPCYNSSDVEDHlslHDLMLLQLRDQASLGSKVKPISLADHCTQPGQKC 
TVSGWGTVTSPRENFPDTLNCAEVKIFPQKKCEDAYPGQITDGMVCAGSSKGADTCQGDSGGPLVCDGALQGITSW 
GSDPCGRSDKPGVYTNICRYLDWIKKIIGSKG 

SEQ ID NO. 33 

PSA 

MOTPWFLTLSVTWIGAAPLILSRIVGGWECEKHSQPW 

RHSDFHPEDTGQVFQVSHSFPHPLYDMSLLKNRFLRPGDDSSHDLMLLRLSEPAELTDAVKVMDLPTQEPALGTTC 
YASGWGSIEPEEFLTPKKLQCVDLHVISNDVCAQVHPQKVTKFMLCAGRWTGGKSTCSGDSGGPLVCNGVLQGITS 
WGSEPCALPERPSLYTKWHYRKWIKDTIVANP 



SEQ ID NO. 34 

HK2 

M\ATDIjVLSIALSVGCTGAVPLIQSRIVGGWECEKH^ 
RHNLFEPEDTGQRVPVSHSFPHPLYNMSLLKHQSLRPDEDSSHDM^ 

YASGWGSIEPEEFLRPRSLQCVSLHLLSNDMCARAYSEKVTEFMLCAGLWTGGKDTCGGDSGGPLVCNGVLQGITS 
WGPEPCALPEKPAVYTKWHYRKWIKDTIAANP 



SEQ ID NO. 35 
HK1 

MWFLVLCLALSLGGTGAAPPIQSRIVGGWECEQHSQPWQAALYHFSTFQCGGILVHRQWVLTAAHCISDNYQLWLG 
RHNLFDDENTAQFVHVSESFPHPGFNMSLLENHTRQADEDYSHDLMLLRLTEPADTITDAVKVVELPTEEPEVGST 
CLASGWGSIEPENFSFPDDLQCVDLKILPNDECKKAHVQKVTDFMLCVGHLEGGKDTCVGDSGGPLMCDGVLQGVT 
SWGYVPCGTPNKPSVAVRVLSYVKWIEDTIAENS 

SEQ ID NO. 3 6 



KLK-L2 

MATARPPWMWVLCALITALLL^^ 

TQPWQAALLLRPNQLYCGAVLWPQWLLTAAHCRKKVFRVRLGHYSLSPWESGQQMFQGVKSIPHPGYSHPGHSN 
DLMLIKLNRRIRPTKDVRPINVSSHCPSAGTKCLVSGWGTTKSPQVHFPKVLQCLNISVLSQKRCEDAYPRQIDDT 
MFCAGDKAGRDSCQGDSGGPWCNGSLQGLVSWGDYPCARPNRPGVYTNLCKFTKWIQETIQANS 



SEQ ID NO. 37 
prostase 



MAT AGN PWGWFLG YL> I LGVAG S L VS G S C S Q I INGE DC S PH S Q PWQ AALVMENELF C S GVLVH PQWVL S AAHCF QNS 
YTIGLGLHSLEADQEPGSQMVEASLSVRHPEYNRPLLANDLMLIKLDESVSSDTIRSISIASQCPTAGNSCLVSGW 
GDLANGRMPTVLQCVNVSWSEEVCSKLYDPLYHPSMFCAGGGHDQKDSCNGDSGGPLICNGYLQGLVSFGKAPCG 
QVGVPGVYTNL CKF TE WI EKTVQ AS 



SEQ ID NO. 38 
HSCCE 
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MARSLLLPLQILLLSLALETAGEEAQGDKIIDGAPCARGSHPWQVALLSGNQLHCHSCCEGGVLVNERWVLTAAHC 
KMNEYTTOLGSDTLGDRRAQRIKASKSFRHPGYSTQTHWDLMLWLNSQARLSSMVKKVRLPSRCEPPGTTCTVS 
GWGTTTSPDVTFPDLMCVDVTCLISPQDCTKVYKDLLENSMLCAGIPDSKKNACNGDSGGPLVCRGTLQGLVS 
WGTFPCGQPNDPGWTQVCKFTKWINDTMKKHR 

SEQ ID NO 39 

CACAACGAGCCTGGGACCGCTGGG 

SEQ ID NO 40 
ATTAAA 

SEQ ID NO 41 

Table 1 KLKl-A 

ATCCCTCCATTCCCATCTTT 

SEQ ID NO 42 

Table 1 KLK1-B 

CACATACAATTCTCTGGTTC 

SEQ ID NO 43 

Table 1 KLK2-A 

AGTGACACTGTCTCAGAATT 

SEQ ID NO 44 

Table 1 KLK2-B 

CCCCAATCTCACGAGTGCAC 

SEQ ID NO 45 
Table 1 E5-A 
GTCGGCTCTGGAGACATTTC 

SEQ ID NO 46 
Table 1 E5-B 
AACTGGGGAGGCTTGAGTC 

SEQ ID NO 47 

Table 1 KLK15-F1 

CTCCTTCCTGCTGGCATCCA 

SEQ ID NO 48 
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Table 1 KLK15-R1 
ATCACACGGGTGGTCATGTG 
SEQ ID NO 49 
Table 1 KLK15-F2 
CAAGTGGCTCTCTACGAGCG 
SEQ ID NO 50 
Table 1 KLK15-R2 
GACACCAGGCTTGGTGGTGT 



